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Polymer Evolution Via Templated Synthesis 

RELATED APPLICATIONS 



[0001] This application claims the benefit of and priority to U.S. Patent Applications Serial 
No. 60/688, 165, filed June 07, 2005, the entire disclosure of which is incorporated. by reference 
5 herein for all purposes. 

Field of the Invention 

[0002] The invention relates generally to polymer synthesis, and more particularly relates to 
evolutionary polymer synthesis by nucleic acid-mediated chemistry. 

Background of the Invention 

10 [0003] Directed evolution minimally requires (/) the high-fidelity translation of a replicable 
informiation carrier such as DNA into the evolving molecules, (//) a stable linkage between the 
translated molecules and their encoding information carriers, (iif) a selection that separates 
functional molecules fi*om non-functional variants, and (iv) the mutation, re-translation, and re- 
selection of molecules surviving the initial selection. Although ribosomes or polymerase 

15 enzymes are used to meet the first requirement during protein or nucleic acid evolution, these 
enzymes cannot be used to generate synthetic polymers that are not close analogues of DNA, 
RNA, or proteins. 

[0004] Under certain circumstances it may be helpful to produce polymers having a desirable 
property. These properties may include, for example, improved catalytic activity, improved 
20 binding activity, improved stability, improved solubility and the like. The improvement of these 
properties has been difficult to achieve using conventional chemistries. Accordingly, there is an 
ongoing need for new methods of making polymers with the desired properties. 

Summary of the Invention 

[0005] The invention provides a variety of methods and compositions that expand the scope 
25 of template-directed synthesis, selection, amplification and evolution of molecules of interest. In 
particular, the invention provides an in vitro method for evolving synthetic polymers of interest 
using nucleic acid templated chemistry. Using the approaches described herein, a skilled artisan 
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may produce polymers having an improved property of interest, for example, catalytic activity, 
biological activity, or binding activity to a target of interest. 

[0006] In one aspect, the invention provides an in vitf'o method for evolving a polymer 
having a particular property. The method comprises the steps of: (a) producing a mixture of 
5 polymers, wherein each polymer is associated with a template oligonucleotide that encoded its 
synthesis; (b) selecting from the mixture produced in step (a) a polymer having a particular 
property, wherein the selected polymer is associated with the template that encoded its synthesis; 
(c) obtaining information about the sequence of the template associated with the polymer 
selected in step (b); (d) producing a plurality of evolved templates, each of which differs by at 
10 least one base from the template associated with the polymer selected in step (b); (e) producing 
a mixture of evolved polymers using the evolved templates, wherein each evolved polymer is 
associated with the template that encoded its synthesis; and (f) selecting from the mixture 
produced in step (e) an evolved polymer having the particular property. 

[0007] In another aspect, the invention provides an in vitro method for evolving a polymer 

15 having a particular property. The method comprises the steps of: (a) combining (i) a plurality of 
different templates, wherein each template comprises a first codon and a second codon, with (ii) 
a plurality of transfer units, at least one of which comprises a monomeric subunit associated with 
an oligonucleotide having a first anti-codon capable of annealing to the first codon of a given 
template and at least one of which comprises a different monomeric subunit associated with an 

20 oligonucleotide comprising a second anti-codon capable of annealing to the second codon of a 
given template under conditions to permit transfer units to anneal to a particular template and to 
permit at least one monomer subunit to become covalently linked to a different monomer subunit 
to produce a polymer associated with the template that encoded its synthesis; (b) selecting a 
polymer having a particular property, wherein the polymer remains associated with the template 

25 that encoded its synthesis; (c) obtaining sequence information about the template associated 

with the polymer selected in step (b); (d) obtaining a plurality of evolved templates that contain 
a codon that differs by at least one base from the template associated with the polymer selected 
in step (b); (e) combining (i) the plurality of evolved templates with (ii) a plurality of transfer 
units under conditions to permit transfer units to anneal to a particular template and to permit a 

30 first monomer subunit to become covalently linked to a second monomer subunit to produce an 
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evolved polymer associated with the evolved template that that encoded its synthesis; and (e) 
selecting an evolved polymer having the particular property. 

[0008] In step (a), the polymer can be covalently attached to the template oligonucleotide. 
Furthermore, in step (e), the polymer can be covalently attached to the template oligonucleotide. 
5 It is contemplated that the method of the invention may be used to develop a variety of different 
polymers. It is understood, however, that the claimed method may be useful in evolving peptidyl 
nucleic acid (PNA) polymers. 

[0009] Under certain circumstances, for example, in order to perform a selection process that 
exploits binding activity, it may be advantageous to permit the polymer to fold. For example, 
10 when the polymer is a PNA polymer, it may be advantageous to disassociate the PNA polymer 
from the template that encoded its synthesis prior to selection. This can be achieved by using an 
oligonucleotide complementary to the template to disassociate the polymer from the template. 

[0010] In another aspect, the invention provides a method of selecting a polymer capable of 
binding to a target molecule. The methods comprises the: (a) combining a plurality of polymers 

15 associated with oligonucleotide templates that encoded their synthesis with a solid support 

having the target molecule disposed thereon under conditions to permit polymers to bind to the 
target molecule; (b) removing unbound polymers; (c) disassociating the bound polymers from 
the solid support to produce a first fraction enriched for polymers that bind to the target 
molecule; (d) combining the disassociated polymers with a fresh solid support having the target 

20 molecule disposed therein under conditions to permit polymers to bind to the target molecule; (e) 
removing unbound polymers; and (f) disassociating the bound polymers from the solid support to 
provide a second fraction enriched for polymers that bind to the target molecule, wherein the 
second fraction contains a greater proportion of polymers that bind to the target than the first 
fraction. 

25 [0011] Improved yields of enriched polymer can be obtained by, for example, in step (d), 

combining fresh solid support with the disassociated polymer in the presence of the solid support 
used in step (a). This approach obviates the step of separating the selected polymer from the 
solid support. This, therefore, reduces losses that can be incurred when the polymer is harvested 
and transferred to another container. This approach can be helpful when the polymer is a PNA 

30 molecule. 
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Definitions 

[0012] The term, "associated with" as used herein describes the interaction between or among 
two or more groups, moieties, compounds, monomers, etc. When two or more entities are 
"associated with" one another as described herein, they are linked by a direct or indirect covalent 
5 or non-covalent interaction. Preferably, the association is covalent. The covalent association 
may be, for example, but without limitation, through an amide, ester, carbon-carbon, disulfide, 
carbamate, ether, thioether, urea, amine, or carbonate linkage. The covalent association may also 
include a linker moiety, for example, a photocleavable linker. Desirable non-covalent 
interactions include hydrogen bonding, van der Waals interactions, dipole-dipole interactions, pi 
10 stacking interactions, hydrophobic interactions, magnetic interactions, electrostatic interactions, 
etc. 

[0013] The terms, "polynucleotide," "nucleic acid", or "oligonucleotide" as used herein refer to a 
polymer of nucleotides. The polymer may include, without limitation, natural nucleosides (/.e., 
adenosine, thymidine, guanosine, cytidine, uridine, deoxyadenosine, deoxj^hymidine, 

15 deoxyguanosine, and deoxycytidine), nucleoside analogs (e.g., 2-aminoadenosine, 2- 
thiothymidine, inosine, pyrrolo-pyrimidine, 3 -methyl adenosine, 5-methylcytidine, 
C5-bromouridine, C5-fluorouridine, C5-iodouridine, C5-propynyl-uridine, 
C5-propynyl-cytidine, C5-methylcytidine, 7-deazaadenosine, 7-deazaguanosine, 
8-oxoadenosine, 8-oxoguanosine, 0(6)-methylguanine, and 2-thiocytidine), chemically modified 

20 bases, biologically modified bases (e.g.^ methylated bases), intercalated bases, modified sugars 
(e.g., 2'-fluororibose, ribose, 2 -deoxyribose, arabinose, and hexose), or modified phosphate 
groups (e.g., phosphorothioates and 5* -N-phosphoramidite linkages). Nucleic acids and 
oligonucleotides may also include other polymers of bases having a modified backbone, such as 
a locked nucleic acid (LNA), a peptide nucleic acid (PNA), a threose nucleic acid (TNA) and 

25 any other polymers capable of serving as a template for an amplification reaction using an 
amplification technique, for example, a polymerase chain reaction, a ligase chain reaction, or 
non-enzymatic template-directed replication. 

[0014] The term, "transfer unit" as used herein, refers to a molecule comprising an 
oligonucleotide having an anti-codon sequence associated with a monomer subunit useful in 
30 template mediated polymer synthesis. 
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[0015] The term, "template" as used herein, refers to a molecule comprising an oligonucleotide 
having at least one codon sequence suitable for a template mediated chemical synthesis. The 
template optionally may comprise (i) a plurality of codon sequences, (ii) an amplification means, 
for example, a PGR primer binding site or a sequence complementary thereto, (iii) a reactive unit 
5 associated therewith, (iv) a combination of (i) and (ii), (v) a combination of (i) and (iii), (vi) a 
combination of (ii) and (iii), or a combination of (i), (ii) and (iii). 

[0016] The terms, "codon" and "anti-codon" as used herein, refer to complementary 
oligonucleotide sequences in the template and in the transfer unit, respectively, that permit the 
transfer unit to anneal to the template during template mediated chemical synthesis. 

10 [0017] Throughout the description, where compositions are described as having, including, or 
comprising specific components, or where processes are described as having, including, or 
comprising specific process steps, it is contemplated that compositions of the present invention 
also consist essentially of, or consist of, the recited components, and that the processes of the 
present invention also consist essentially of, or consist of, the recited processing steps. Further, 
15 it should be understood that the order of steps or order for performing certain actions are 

immaterial so long as the invention remains operable. Moreover, two or more steps or actions 
may be conducted simultaneously. 

Description of the Drawings 

[0018] Figure 1 is a schematic representation of nucleic acid mediated polymer synthesis. 

[0019] Figure 2 depicts a schematic representation of m vitro synthetic polymer evolution 
using nucleic acid-templated organic synthesis. 

[0020] Figure 3A depicts DNA-templated polymerization, in which ten consecutive DNA- 
templated reductive amination couplings resulted in PNA 40-mers containing secondary amine 
linkages between every fourth nucleotide. The efficiency of this process for all nine PNA 
aldehydes of the form H2N-gvvt-CHO (where v = a, c, or g) to generate predominantly full 
length synthetic polymer products is shown by denaturing PAGE. Figure 3B depicts an 
experiment in which an equimolar mixture of nine gvvt building blocks was combined with a 
library of 3.5 x 10^ DNA templates to yield a library of PNA covalently linked to their DNA 
templates, as shown by denaturing PAGE. The library polymerization reactions contained 4.4, 
8.8, or 16.2 equivalents of all nine gvvt PNA aldehyde building blocks. 



20 
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[0021] Figure 4 depicts in vitro selection of a synthetic polymer library for papain binding 
activity. Figure 4 A shows displacement of the PNA strand from translated PNA-DNA 
conjugates (see Figure 2). Figure 4B depicts an agarose gel electrophoresis of PCR-amplified 
DNA templates surviving each of four successive rounds of papain affinity selection. Figure 4C 
5 depicts template sequences resulting from in vitro selection of the PNA library. The variable 
bases of the most highly represented sequence (PI) emerging from the selection are outlined. 
Figure 4D depicts a hypothetical model of PI secondary structure based on the predicted 
structure of the corresponding RNA at 1 M simulated salt concentration. 

[0022] Figure 5 depicts enrichment assays in which synthetic polymers were selected using 
10 papain affinity selection. Figure 5 A depicts papain affinity selection of the translation products 
of a 500:1 mixture of Ml (mutant):? 1 (selected sequence) templates, and of a 500:1 mixture of 
Ul (unrelated) :P1 templates. Based on the unique restriction digest patterns of Ml, PI, and Ul, 
the selection enriches the PI translation product -500-fold, but not the Ml or Ul products. 
Figure SB depicts the results of a fluorescence polarization-based papain binding assay of 
15 truncated PI PNA (data points connected by a solid line) versus Ml PNA or PI DNA or BFL- 
amine alone (data points for each are not connected by a line), all prepared by solid-phase 
synthesis and labelled at their amino termini with BODIPY-fluorescein (BFL) succinimidyl 
ester. This graph shows the ability of the selected synthetic polymer to bind papain in the 
absence of its DNA template. Figure 5C depicts results of binding assays of the BFL-labelled 
20 truncated PI PNA with papain (data points connected by a solid line) versus with lysozyme or 
with trypsin (data points for each are not connected by a line), and reveals that the PI PNA does 
not bind lysozyme or trypsin with detectable affinity. Error bars in Figures SB and 5C represent 
standard deviations of three or more independent trials, 

[0023] Figure 6 depicts an experiment in which diversification of PI, retranslation, and 
25 reselection yields P2, a synthetic polymer with improved papain affinity. Figure 6A shows the 
five positions (corresponding to 'v' in 2nd library) at which PI was diversified into a second- 
generation library. The clones emerging from papain affinity selection of the second library are 
shown. The most highly represented sequence (P2) converges on the sequences outlined. 
Figure 6B depicts the origins and hypothetical secondary structure model of the P2 PNA, 
30 Figure 6C depicts the results of a fluorescence polarization-based papain binding assay of BFL- 
labelled truncated P2 PNA (data points connected by a solid line) compared with BFL-labelled 
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truncated PI PNA (data points connected by a dashed line) and BFL-amine alone (data points 
not connected by a line). Figure 6D depicts the results of a binding assay in which P2 was pre- 
incubated with 10 equivalents of either a complementary DNA 18-mer (data points connected by 
a dashed line) or a control DNA 18-mer containing the P2-complementary codons in a scrambled 
5 order (data points connected by a solid line), showing that P2 binding to papain is inhibited by 
DNA complementary to P2 . Error bars in Figure 6C and 6D represent standard deviations of 
three or more independent trials. 

[0024] Figure 7 depicts typical PAGE results showing PNA displacement of an individual 
translated double-hairpin template. Addition of the restriction endonuclease Sph I exclusively 
10 cleaves double-stranded DNA in the template (lane 1 = template x Sph I; lane 2 = "filled in" 

template using DNA polymerase x Sph I; lane 3 = translation product x Sph I; lane 4 = translated 
and displacement product x Sph I). The presence of the fast-running band in both lane 2 and lane 
4 represents cleaved double-stranded DNA in both the positive control (lane 2) and in the 
translated and displaced product (lane 4). 

15 [0025] Figure 8 is a schematic representation of an approach for reducing out-of-frame 
codon and anti-codon annealing during templated synthesis. 

[0026] Figure 9 shows exemplary coupling chemistries useful in nucleic acid-mediated 
polymerization reactions. 

Description of Certain Embodiments of the Invention 

[0027] Nucleic acid templated synthesis as described herein permits the production, 
selection, amplification and evolution of a broad variety of non-natural polymers. The invention 
is particularly useful for synthesizing non-naturally occurring polymers. For example, the 
invention can be used to synthesize non-biological polymers (e.g., polymers other than DNA, 
RNA, or protein). In nucleic acid-mediated synthesis, the information encoded by a DNA or 
other nucleic acid sequence is translated into the synthesis of a reaction product. The nucleic 
acid template typically comprises a plurality of coding regions which anneal to complementary 
anti-codon sequences associated with reactive units, thereby bringing the reactive units together 
in a sequence-specific manner to create a reaction product. Since nucleic acid hybridization is 
sequence-specific, the result of a nucleic acid-templated reaction is the translation of a specific 
nucleic acid sequence into a corresponding reaction product. 



20 
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[0028] A general scheme for polymer synthesis is presented in Figure 1. As shown in 
Figure 1, in this approach the template can bring together, either simultaneously or sequentially, 
a plurality of transfer units in a sequence-specific manner. The reactive units on each annealed 
transfer unit can then be reacted with one another in a polymerization process to produce a 
5 polymer. Using this approach it is possible to generate a variety of non-natural polymers. The 
polymerization may be a step-by-step process or may be a simultaneous process whereby all the 
annealed monomers are reacted in one reaction sequence 

[0029] A proposed approach for the evolutionary synthesis of polymers is set forth in Figure 
2. Information transfer ^om the nucleic acid template to the polymer occurs by template- 
10 directed syntheses. 

[0030] Initially, a plurality of different oligonucleotide templates are provided. When 
transfer units are combined with the templates under conditions for DNA-templated 
polymerization, a plurality of polymers are produced, each of which is associated with the 
template that encoded its synthesis (1). Thereafter, the synthetic polymer is disassociated from 

15 the template via a DNA polymerase that synthesizes a strand complementary to the template (2), 
As shown, the synthetic polymer becomes substantially disassociated from the template that 
encoded its synthesis but yet still remains attached, for example, covalently attached, to the 
template. In this example, Watson and Crick-type base pairing between the synthetic polymer 
and the template is reduced or eliminated. As a result of strand displacement, the polymer is 

20 permitted to fold (3). The polymers that bind to a particular target are selected to provide a 
population of polymers that bind to the target (4). The templates associated with the selected 
polymer then are amplified, for example, via polymerase chain reaction (PGR) (5). The 
amplified sequence can then be sequenced to identify and/or show the synthetic history of the 
polymer of interest (6). Furthermore, the amplified template can be mutated to give another 

25 population of templates that can be used for another round of polymer synthesis and selection 
(7). 

I. TEMPLATE CONSmERATIONS 

[0031] The nucleic acid template can direct a wide variety of chemical reactions without 
obvious structural requirements by sequence-specifically recruiting reactants linked to 
30 complementary oligonucleotides. As discussed, the nucleic acid mediated format permits 

reactions that may not be possible using conventional synthetic approaches. During synthesis. 
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the template hybridizes or anneals to one or more transfer units to direct the synthesis of a 
reaction product, which during certain steps of templated synthesis remain associated with the 
template. A reaction product then is selected or screened based on certain criteria, such as the 
ability to bind to a preselected target molecule. Once the reaction product has been identified, 
5 the associated template can then be sequenced to decode the synthetic history of the reaction 
product. Furthermore, as will be discussed in more detail below, the template may be evolved to 
guide the synthesis of another chemical compound or library of chemical compounds. 

(i) Template Format 

[0032] The template may incorporate a hairpin loop on one end terminating in a reactive unit 
10 that can interact with one or more reactive units associated with transfer units. For example, a 
DNA template can comprise a hairpin loop terminating in a 5 '-amino group, which may or may 
not be protected. The amino group may act as an initiation point for formation of an unnatural 
polymer. 

[0033] The length of the template may vary greatly depending upon the type of the nucleic 
15 acid4emplated synthesis contemplated. For example, in certain embodiments, the template may 
be from 10 to 10,000 nucleotides in length, from 20 to 1,000 nucleotides in length, from 20 to 
400 nucleotides in length, from 40 to 1,000 nucleotides in length, or from 40 to 400 nucleotides 
in length. The length of the template will of course depend on, for example, the length of the 
codons, the complexity of the library, the complexity and/or size of a reaction product, the use of 
20 spacer sequences, etc, 

(ii) Codon Usage 

[0034] It is contemplated that the sequence of the template may be designed in a number of 
ways without going beyond the scope of the present invention. For example, the length of the 
codon must be determined and the codon sequences must be set. If a codon length of two is 

25 used, then using the four naturally occurring bases only 16 possible combinations are available to 
be used in encoding the library. If the length of the codon is increased to three (the number 
Nature uses in encoding proteins), the number of possible combinations increases to 64. If the 
length of the codon is increased to four, the number of possible combinations increases to 256. 
Other factors to be considered in determining the length of the codon are mismatching, frame- 

30 shifting, complexity of library, etc. As the length of the codon is increased up to a certain point 
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the number of mismatches is decreased; however, excessively long codons likely will hybridize 
despite mismatched base pairs. 

[0035] Although the length of the codons may vary, the codons may range from 2 to 50 
nucleotides, from 2 to 40 nucleotides, from 2 to 30 nucleotides, from 2 to 20 nucleotides, from 2 
5 to 15 nucleotides, from 2 to 10 nucleotides, from 3 to 50 nucleotides, from 3 to 40 nucleotides, 
from 3 to 30 nucleotides, from 3 to 20 nucleotides, from 3 to 15 nucleotides, from 3 to 10 
nucleotides, from 4 to 50 nucleotides, from 4 to 40 nucleotides, from 4 to 30 nucleotides, from 4 
to 20 nucleotides, from 4 to 15 nucleotides, from 4 to 10 nucleotides, from 5 to 50 nucleotides, 
from 5 to 40 nucleotides, from 5 to 30 nucleotides, from 5 to 20 nucleotides, from 5 to 15 

10 nucleotides, from 5 to 10 nucleotides, from 6 to 50 nucleotides, from 6 to 40 nucleotides, from 6 
to 30 nucleotides, from 6 to 20 nucleotides, from 6 to 15 nucleotides, from 6 to 10 nucleotides, 
from 7 to 50 nucleotides, from 7 to 40 nucleotides, from 7 to 30 nucleotides, from 7 to 20 
nucleotides, from 7 to 15 nucleotides, from 7 to 10 nucleotides, from 8 to 50 nucleotides, from 8 
to 40 nucleotides, from 8 to 30 nucleotides, from 8 to 20 nucleotides, from 8 to 15 nucleotides, 

15 from 8 to 10 nucleotides, from 9 to 50 nucleotides, from 9 to 40 nucleotides, from 9 to 30 
nucleotides, from 9 to 20 nucleotides, from 9 to 15 nucleotides, from 9 to 10 nucleotides. 
Codons, however, preferably are 3, 4, 5, 6, 7, 8, 9 or 10 nucleotides in length. 

[0036] In one embodiment, the set of codons used in the template maximizes the number of 
mismatches between any two codons within a codon set to ensure that only the proper anti- 

20 codons of the transfer units anneal to the codon sites of the template. Furthermore, it is 

important that the template has mismatches between all the members of one codon set and all the 
codons of a different codon set to ensure that the anti-codons do not inadvertently bind to the 
wrong codon set For example, with regard to the choice of codons n bases in length, each of the 
codons within a particular codon set should differ with one another by k mismatches, and all of 

25 the codons in one codon set should differ by m mismatches with all of the codons in the other 
codon set. Exemplary values for n. A, and w, for a variety of codon sets suitable for use on a 
template are published, for example, in Table 1 of U.S. Patent Application Publication No. US- 
2004/0180412, by Liu et al 

[0037] Using an appropriate algorithm, it is possible to generate sets of codons that 
30 maximize mismatches between any two codons within the same set, where the codons are n 
bases long having at least k mismatches between any two codons. Since between any two 
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codons, there must be at least k mismatches, any two subcodons of w — (A:— 1) bases must have at 
least one mismatch. This sets an upper limit of 4"-*^' on the size of any («, 1C) codon set. Such an 
algorithm preferably starts with the 4""^^ possible subcodons of length w - 1) and then tests 
all combinations of adding A: - 1 bases for those that always maintain k mismatches. All possible 
5 (w, K) sets can be generated for n < 6. For « > 6, the 4""^^ upper limits of codons cannot be met 
and a "full" packing of viable codons is mathematically impossible. In addition to there being at 
least one mismatch k between codons within the same codon set, there should also be at least one 
mismatch m between all the codons of one codon set and all the codons of another codon set. 
Using this approach, different sets of codons can be generated so that no codons are repeated. 

10 [0038] By way of example, four (n=5^ Af=3, m==l) sets, each with 64 codons, can be chosen 
that always have at least one mismatch between any two codons in different sets and at least 
three mismatches between codons in the same set, as described, for example, in Tables 2-5 of 
U.S. Patent Application Publication No. US-2004/0180412, by Liu et al. Similarly, four (/?=6, 
Ar=4, m=2) sets, each with 64 codons, can be chosen that always have at least two mismatches 

15 between any two codons in different codon sets and at least four mismatches between codons in 
the same codon set as described, for example, in Tables 6-9 of U.S. Patent Application 
Publication No. US-2004/01 80412, by Liu et al 

[0039] Codons can also be chosen to increase control over the GC content and, therefore, the 
melting temperature of the codon and anti-codon. Codons sets with a wide range in GC content 

20 versus AT content may result in reagents that anneal with different efficiencies due to different 
melting temperatures. By screening for GC content among different («, A) sets, the GC content 
for the codon sets can be optimized. For example, the four (6, 4, 2) codon sets set forth in Tables 
6-9 of U.S. Patent Application Publication No. US-2004-0180412-A1 each contain 40 codons 
with identical GC content {i.e., 50% GC content). By using only these 40 codons at each 

25 position, all the reagents in theory will have comparable melting temperatures, removing 

potential biases in annealing that might otherwise affect library synthesis. Longer codons that 
maintain a large number of mismatches such as those appropriate for certain applications such as 
the reaction discovery system can also be chosen using this approach. For example, by 
combining two (6, 4) sets together while matching low GC to high GC codons, (12, 8) sets with 

30 64 codons all with 50% GC content can be generated for use in reaction discovery selections as 
well as other application where multiple mismatches might be advantageous. These codons 
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satisfy the requirements for encoding a 30 x 30 matrix of functional group combinations for 
reaction discovery. 

[0040] Although an anti-codon is intended to bind only to a codon, an anti-codon may also 
bind to an unintended sequence on a template if complementary sequence is present. Thus, an 
5 anti-codon may inadvertently bind to a non-codon sequence. Alternatively, an anti-codon might 
inadvertently bind out-of-frame by annealing in part to one codon and in part to another codon or 
to a non-codon sequence. Finally, an anti-codon might bind in-frame to an incorrect codon, an 
issue addressed by the codon sets described above by requiring at least one base difference 
distinguishing each codon. In Nature, the problems of noncoding sequences and out-of-frame 

10 binding are avoided by the ribosome. The nucleic acid-templated methods described herein, 
however, do not take advantage of the ribosome's fidelity. Therefore, in order to avoid 
erroneous annealing, the templates can be designed such that sequences complementary to anti- 
codons are found exclusively at in-frame codon positions. For example, codons can be designed 
to begin, or end, with a particular base (e.g., "G"). If that base is omitted from all other positions 

15 in the template {le., all other positions are restricted to T, C, and A), only perfect codon 

sequences in the template will be at the in-frame codon sequences. Similarly, the codon may be 
designed to be sufficiently long such that its sequence is unique and does not appear elsewhere in 
a template. 

[0041] When the nucleic acid-templated synthesis is used to produce a polymer, spacer 
20 sequences may also be placed between the codons to prevent frame shifting. More preferably, 
the bases of the template that encode each polymer subunit (the "genetic code" for the polymer) 
may be chosen from Table 1 to preclude or minimize the possibility of out-of-frame annealing. 
These genetic codes reduce undesired frameshifled nucleic acid-templated polymer translation 
and differ in the range of expected melting temperatures and in the minimum number of 
25 mismatches that result during out-of-frame annealing. 
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TABLE 1: Representative Genetic Codes for Nucleic Acid-templated 
Polymers That Preclude Out-Of-Frame Annealing. 



SequenQe 


• IS^umber of Pos^^^^ 


VVNT 


36 possible codons 


NVVT 


36 possible codons 


SSWT 


8 possible codons 


SSST 


8 possible codons 


SSNT 


16 possible codons 


VNVNT or NVNVT 


144 possible codons 


SSSWT or SSWST 


16 possible codons 


SNSNT or NSNST 


64 possible codons 


SSNWT or SWNST 


32 possible codons 


WSNST or NSWST 


32 possible codons 



where, V = A, C, or G, S= C or G, W = A or T, and N = A, C, G, or T 

5 [0042] As in Nature, start and stop codons are useful, particularly in the context of polymer 
synthesis, to restrict erroneous anti-codon annealing to non-codons and to prevent excessive 
extension of a growing polymer. For example, a start codon can anneal to a transfer unit bearing 
a small molecule scaffold or a start monomer unit for use in polymer synthesis; the start 
monomer unit can be masked by a photolabile protecting group. A stop codon, if used to 

10 terminate polymer synthesis, should not conflict with any other codons used in the synthesis and 
should be of the same general format as the other codons. Generally, a stop codon can encode a 
monomer unit that terminates polymerization by not providing a reactive group for further 
attachment. For example, a stop monomer unit may contain a blocked reactive group such as an 
acetamide rather than a primary amine. In other embodiments, the stop monomer unit can 

15 include a biotinylated terminus that terminates the polymerization and facilitates purification of 
the resulting polymer. 
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[0043] An exemplary approach for minimizing out-of-firame annealing during polymer 
synthesis is described in Figure 8. 

(Hi) Template Synthesis 

[0044] The templates may be synthesized using methodologies well known in the art. For 
5 example, the nucleic acid sequence may be prepared using any method known in the art to 
prepare nucleic acid sequences. These methods include both in vivo and in vitro methods 
including PGR, plasmid preparation, endonuclease digestion, solid phase synthesis (for example, 
using an automated synthesizer), in vitro transcription, strand separation, etc. Following 
synthesis, the template, when desired may be associated (for example, covalently or non 
10 covalently coupled) with a reactive unit of interest using standard coupling chemistries known in 
the art. 

[0045] An efficient method to synthesize a large variety of templates is to use a "split-pool" 
technique. The oligonucleotides are synthesized using standard 3' to 5' chemistries. First, the 
constant 3' end is synthesized. This is then split into n different vessels, where n is the number 

15 of different codons to appear at that position in the template. For each vessel, one of the n 

different codons is synthesized on the (growing) 5' end of the constant 3' end. Thus, each vessel 
contains, from 5' to 3', a different codon attached to a constant 3' end. The n vessels are then 
pooled, so that a single vessel contains n different codons attached to the constant 3' end. Any 
constant bases adjacent the 5' end of the codon are now synthesized. The pool then is split into 

20 m different vessels, where m is the number of different codons to appear at the next (more 5') 
position of the template. A different codon is synthesized (at the 5' end of the growing 
oligonucleotide) in each of the m vessels. The resulting oligonucleotides are pooled in a single 
vessel. Splitting, synthesizing, and pooling are repeated as required to synthesize all codons and 
constant regions in the oligonucleotides. 

25 11. TRANSFER UNITS 

[0046] A transfer unit comprises an oligonucleotide containing an anti-codon sequence and a 
reactive unit. The anti-codons are designed to be complementary to the codons present in the 
template. Accordingly, the sequences used in the template and the codon lengths should be 
considered when designing the anti-codons. Any molecule complementary to a codon used in 
30 the template may be used, including natural or non-natural nucleotides. In certain embodiments, 
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the codons include one or more bases found in nature (/.e., thymidine, uracil, guanidine, 
cytosine, and adenine). Thus, the anti-codon can include one or more nucleotides normally 
found in Nature with a base, a sugar, and an optional phosphate group. Alternatively, the bases 
may be connected via a backbone other than the sugar-phosphate backbone normally found in 
5 Nature (e.g., non-natural nucleotides). 

[0047] As discussed above, the anti-codon is associated with a particular type of reactive unit 
to form a transfer unit. The reactive unit may represent a distinct entity or may be part of the 
functionality of the anti-codon unit. In certain embodiments, each anti-codon sequence is 
associated with one monomer type. For example, the anti-codon sequence ATTAG may be 

10 associated with a carbamate residue with an isobutyl side chain, and the anti-codon sequence 

CATAG may be associated with a carbamate residue with a phenyl side chain. This one-for-one 
mapping of anti-codon to monomer units allows the decoding of any polymer of the library by 
sequencing the nucleic acid template used in the synthesis and allows synthesis of the same 
polymer or a related polymer by knowing the sequence of the original polymer. By changing 

15 (e.g., mutating) the sequence of the template, different monomer units may be introduced, 
thereby allowing the synthesis of related polymers, which can subsequently be selected and 
evolved. In certain preferred embodiments, several anti-codons may code for one monomer unit 
as is the case in Nature. 

[0048] Additionally, the association between the anti-codon and the reactive unit, for 
20 example, a monomer unit, in the transfer unit may be covalent or non-covalent. The association 
maybe through a covalent bond and, in certain embodiments, the covalent bond may be 
severable. 

[0049] Thus, the anti-codon can be associated with the reactant through a linker moiety. The 
linkage can be cleavable by light, oxidation, hydrolysis, exposure to acid, exposure to base, 

25 reduction, etc, Fruchtel et al (1996) Angew. Chem. Int. Ed. Engl, 35: 17 describes a variety of 
linkages useful in the practice of the invention. The linker facilitates contact of the reactant with 
the small molecule scaffold and in certain embodiments, depending on the desired reaction, 
positions DNA as a leaving group ("autocleavable" strategy), or may link reactive groups to the 
template via the "scarless" linker strategy (which yields product without leaving behind an 

30 additional atom or atoms having chemical functionality), or a "useful scar" strategy (in which a 
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portion of the linker is left behind to be fiinctionalized in subsequent steps following linker 
cleavage). 

[0050] With the "autocleavable" linker strategy, the DNA-reactive group bond is cleaved as 
a natural consequence of the reaction. In the "scarless" linker strategy, DNA-templated reaction 
5 of one reactive group is followed by cleavage of the linker attached through a second reactive 
group to yield products without leaving behind additional atoms capable of providing chemical 
functionality. Alternatively, a "useful scar" may be utilized on the theory that it may be 
advantageous to introduce useful atoms and/or chemical groups as a consequence of linker 
cleavage. In particular, a "useful scar" is left behind following linker cleavage and can be 
10 functionalized in subsequent steps. 

[0051] The anti-codon and the reactive unit (monomer unit) may also be associated through 
non-covalent interactions such as ionic, electrostatic, hydrogen bonding, van der Waals 
interactions, hydrophobic interactions, pi-stacking, etc, and combinations thereof To give but 
one example, an anti-codon may be linked to biotin, and a monomer unit linked to streptavidin. 
15 The propensity of streptavidin to bind biotin leads to the non-covalent association between the 
anti-codon and the monomer unit to form the transfer unit. 

[0052] The specific annealing of transfer units to templates permits the use of transfer units 
at concentrations lower than concentrations used in many traditional organic syntheses. Thus, 
transfer units can be used at submillimolar concentrations (e.g. less than 100 |liM, less than 10 
20 iLiM, less than 1 |jM, less than 100 nM, or less than 10 nM). 

m. CHEMICAL REACTIONS 

[0053] A variety of compounds and/or libraries can be prepared using the methods described 
herein. In certain embodiments, compounds that are not, or do not resemble, nucleic acids or 
analogs thereof, are synthesized according to the method of the invention. 

25 (i) Coupling Reactions for Polymer Synthesis 

[0054] In certain embodiments, polymers, specifically unnatural polymers, are prepared 
according to the method of the present invention. The unnatural polymers that can be created 
using the inventive method and system include any unnatural polymers. Exemplary unnatural 
polymers include, but are not limited to, peptide nucleic acid (PNA) polymers, polycarbamates, 
30 polyureas, polyesters, polyacrylate, polyalkylene (e.g-., polyethylene, polypropylene). 
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polycarbonates, polypeptides with unnatural stereochemistry, polypeptides with unnatural amino 
acids, and combination thereof. In certain embodiments, the polymers comprise at least 10, 25, 
75, 100, 125, 150 monomer units or more. The polymers synthesized using the inventive system 
may be used, for example, as catalysts, pharmaceuticals, metal chelators, or catalysts. 

5 [0055] In preparing certain unnatural polymers, the monomer units attached to the anti- 

codons may be any monomers or oligomers capable of being joined together to form a polymer. 
The monomer units may be, for example, carbamates, D-amino acids, unnatural amino acids, 
PNAs, ureas, hydroxy acids, esters, carbonates, acrj^lates, or ethers. In certain embodiments, the 
monomer units have two reactive groups used to link the monomer unit into the growing 

10 polymer chain, as depicted in Figure 2. Preferably, the two reactive groups are not the same so 
that the monomer unit may be incorporated into the polymer in a directional sense, for example, 
at one end may be an electrophile and at the other end a nucleophile. Reactive groups may 
include, but are not limited to, esters, amides, carboxylic acids, activated carbonyl groups, acid 
chlorides, amines, hydroxyl groups, and thiols. In certain embodiments, the reactive groups are 

15 masked or protected (Greene et aL (1999) PROTECTIVE Groups in Organic Synthesis 3rd 

Edition, Wiley) so that polymerization may not take place until a desired time when the reactive 
groups are deprotected. Once the monomer units are assembled along the nucleic acid template, 
initiation of the polymerization sequence results in a cascade of polymerization and deprotection 
steps wherein the polymerization step results in deprotection of a reactive group to be used in the 

20 subsequent polymerization step. 

[0056] The monomer units to be polymerized can include two or more monomers depending 
on the geometry along the nucleic acid template. The monomer units to be polymerized must be 
able to stretch along the nucleic acid template and particularly across the distance spanned by its 
encoding anti-codon and optional spacer sequence. In certain embodiments, the monomer unit 
25 actually comprises two monomers, for example, a dicarbamate, a diurea, or a dipeptide. In yet 
other embodiments, the monomer unit comprises three or more monomers. 

[0057] The monomer units may contain any chemical groups known in the art. Reactive 
chemical groups especially those that would interfere with polymerization, hybridization, etc. , 
are preferably masked using Icnown protecting groups (Greene et aL (1999) supra). In general, 
30 the protecting groups used to mask these reactive groups are orthogonal to those used in 
protecting the groups used in the polymerization steps. 
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[0058] It has been discovered that, under certain circumstances, the type of chemical reaction 
may affect the fidelity of the polymerization process. For example, distance independent 
chemical reactions (for example, reactions that occur efficiently when the reactive units are 
spaced apart by intervening bases, for example, amine acylation reactions) may result in the 
5 spurious incorporation of the wrong monomers at a particular position of a polymer chain. In 
contrast, by choosing chemical reactions for template mediated syntheses that are distance 
dependent (for example, reactions that become inefficient the further the reactive units are 
spaced part via intervening bases, for example, reductive amination reactions), it is possible 
control the fidelity of the polymerization process. 

10 [0059] Exemplary coupling chemistries for DNA-templated polymerization reactions are 
presented in Figure 9. Exemplary chemistries include, for example, olefin metathesis, amine 
acylations, wittig olefinations and reductive aminations. 

(iv) Reaction Conditions 

[0060] Nucleic acid-templated reactions can occur in aqueous or non-aqueous (z.e., organic) 
15 solutions, or a mixture of one or more aqueous and non-aqueous solutions. In aqueous solutions, 
reactions can be performed at pH ranges firom about 2 to about 12, or preferably from about 2 to 
about 10, or more preferably fi-om about 4 to about 10. The reactions used in DNA-templated 
chemistry preferably should not require very basic conditions (e.g., pH > 12, pH > 10) or very 
acidic conditions (e.g., pH < 1, pH < 2, pH < 4), because extreme conditions may lead to 
20 degradation or modification of the nucleic acid template and/or molecule (for example, the 

polymer, or small molecule) being synthesized. The aqueous solution can contain one or more 
inorganic salts, including, but not limited to, NaCl, Na2S04, KCl, Mg^^, Mn"^^, etc, at various 
concentrations. 

[0061] Organic solvents suitable for nucleic acid-templated reactions include, but are not 
25 limited to, methylene chloride, chloroform, dimethylformamide, and organic alcohols, including 
methanol and ethanol. To permit quantitative dissolution of reaction components in organic 
solvents, quaternized ammonium salts, such as, for example, long chain tetraalkylammonium 
salts, can be added (Jost et al (1989) NUCLEIC AciDS RES. 17: 2143; Mel'nikov et al. (1999) 
Langmuir 15: 1923-1928). 
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[0062] Nucleic acid-templated reactions may require a catalyst, such as, for example, 
homogeneous, heterogeneous, phase transfer, and asymmetric catalysis. In other embodiments, a 
catalyst is not required. The presence of additional, accessory reagents not linked to a nucleic 
acid are preferred in some embodiments. Useful accessory reagents can include, for example, 
5 oxidizing agents (e.g., NaI04); reducing agents (e,g., NaCNBHa); activatmg reagents (e.g., EDC, 
NHS, and sulfo-NHS); transition metals such as nickel (e.g., Ni(N03)2), rhodium (e,g. RhCls), 
ruthenium (e.g, RuCls), copper (e.g. Cu(N03)2), cobalt (e.g. C0CI2), iron (e.g. Fe(N03)3), 
osmium (e.g. OSO4), titanium (e.g. TiCU or titanium tetraisopropoxide), palladium (e.g. 
NaPdCU), or Ln; transition metal ligands (e.g., phosphines, amines, and halides); Lewis acids; 
10 and Lewis bases. 

[0063] Reaction conditions preferably are optimized to suit the nature of the reactive units 
and oligonucleotides used. 

(v) Classes of Chemical Reactions 

[0064] Known chemical reactions for synthesizing polymers can be used in nucleic acid- 
1 5 templated reactions. Thus, reactions such as those listed in March *s Advanced Organic 

Chemistry, Organic Reactions, Organic Syntheses, organic text books, journals such as Journal 
of the American Chemical Society, Journal of Organic Chemistry, Tetrahedron, etc., and 
Carruther's Some Modem Methods of Organic Chemistry can be used. The chosen reactions 
preferably are compatible with nucleic acids such as DNA or RNA or are compatible with the 
20 modified nucleic acids used as the template. 

[0065] Reactions useful in nucleic-acid templated chemistry include, for example, 
substitution reactions, carbon-carbon bond forming reactions, elimination reactions, acylation 
reactions, and addition reactions. An illustrative but not exhaustive list of aliphatic nucleophilic 
substitution reactions useful in the present invention includes, for example, 8^2 reactions, SnI 
25 reactions, SnI reactions, allylic rearrangements, nucleophilic substitution at an aliphatic trigonal 
carbon, and nucleophilic substation at a vinylic carbon. 

[0066] Specific aliphatic nucleophilic substitution reactions with oxygen nucleophiles 
include, for example, hydrolysis of alkyl halides, hydrolysis of gen-dihalides, hydrolysis of 
1,1,1-trihalides, hydrolysis of alkyl esters or inorganic acids, hydrolysis of diazo ketones, 
30 hydrolysis of acetal and enol ethers, hydrolysis of epoxides, hydrolysis of acyl halides. 
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hydrolysis of anhydrides, hydrolysis of carboxylic esters, hydrolysis of amides, alkylation with 
alky! halides (Williamson Reaction), epoxide formation, alkylation with inorganic esters, 
alkylation with diazo compounds, dehydration of alcohols, transetherification, alcoholysis of 
epoxides, alkylation with onium salts, hydroxylation of silanes, alcoholysis of acyl halides, 
5 alcoholysis of anhydrides, esterfication of carboxylic acids, alcoholysis of carboxylic esters 
(transesterfication), alcoholysis of amides, alkylation of carboxylic acid salts, cleavage of ether 
with acetic anhydride, alkylation of carboxylic acids with diazo compounds, acylation of 
caroxylic acids with acyl halides, acylation of carboxylic acids with carboxylic acids, formation 
of oxonium salts, preparation of peroxides and hydroperoxides, preparation of inorganic esters 
10 (e.g., nitrites, nitrates, sulfonates), preparation of alcohols from amines, and preparation of mixed 
organic-inorganic anhydrides. 

[0067] Specific aliphatic nucleophilic substitution reactions with sulfur nucleophiles, which 
tend to be better nucleophiles than their oxygen analogs, include, for example, attack by SH at an 
alkyl carbon to form thiols, attack by S at an alkyl carbon to form thioethers, attack by SH or SR 
15 at an acyl carbon, formation of disulfides, formation of Bunte salts, alkylation of sulfinic acid 
salts, and formation of alkyl thiocyanates. 

[0068] Aliphatic nucleophilic substitution reactions with nitrogen nucleophiles include, for 
example, alkylation of amines, JV-arylation of amines, replacement of a hydroxy by an amino 
group, transamination, transamidation, alkylation of amines with diazo compounds, amination of 

20 epoxides, amination of oxetanes, amination of aziridines, amination of alkanes, formation of 

isocyanides, acylation of amines by acyl halides, acylation of amines by anhydrides, acylation of 
amines by carboxylic acids, acylation of amines by carboxylic esters, acylation of amines by 
amides, acylation of amines by other acid derivatives, iV-alkylation or JV-arylation of amides and 
imides, iV^acylation of amides and imides, formation of aziridines from epoxides, formation of 

25 nitro compounds, formation of azides, formation of isocyanates and isothiocyanates, and 
formation of azoxy compounds. 

[0069] Aliphatic nucleophilic substitution reactions with halogen nucleophiles include, for 
example, attack at an allcyl carbon, halide exchange, formation of alkyl halides from esters of 
sulfuric and sulfonic acids, formation of alkyl halides from alcohols, formation of alkyl halides 
30 from ethers, formation of halohydrins from epoxides, cleavage of carboxylic esters with lithium 
iodide, conversion of diazo ketones to a-halo ketones, conversion of amines to halides. 
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conversion of tertiary amines to cyanamides (the von Braun reaction), formation of acyl halides 
from carboxylic acids, and formation of acyl halides from acid derivatives, 

[0070] Aliphatic nucleophilic substitution reactions using hydrogen as a nucleophile include, 
for example, reduction of alkyl halides, reduction of tosylates, other sulfonates, and similar 
5 compounds, hydrogenolysis of alcohols, hydrogenolysis of esters (Barton-McCombie reaction), 
hydrogenolysis of nitriles, replacement of alkoxyl by hydrogen, reduction of epoxides, reductive 
cleavage of carboxylic esters, reduction of a C-N bond, desulfurization, reduction of acyl halides, 
reduction of carboxylic acids, esters, and anhydrides to aldehydes, and reduction of amides to 
aldehydes. 

1 0 [0071] Although certain carbon nucleophiles may be too nucleophilic and/or basic to be used 
in certain embodiments of the invention, aliphatic nucleophilic substitution reactions using 
carbon nucleophiles include, for example, coupling with silanes, coupling of alkyl halides (the 
Wurtz reaction), the reaction of alkyl halides and sulfonate esters with Group I (I A) and II (II A) 
organometallic reagents, reaction of alkyl halides and sulfonate esters with organocuprates, 

15 reaction of alkyl halides and sulfonate esters with other organometallic reagents, ally lie and 

propargylic coupling with a halide substrate, coupling of organometallic reagents with esters of 
sulfuric and sulfonic acids, sulfoxides, and sulfones, coupling involving alcohols, coupling of 
organometallic reagents with carboxylic esters, coupling of organometallic reagents with 
compounds containing an esther linkage, reaction of organometallic reagents with epoxides, 

20 reaction of organometallics with aziridine, alkylation at a carbon bearing an active hydrogen, 
alkylation of ketones, nitriles, and carboxylic esters, alkylation of carboxylic acid salts, 
alkylation at a position a to a heteroatom (alkylation of 1,3-dithianes), alkylation of dihydro-1,3- 
oxazine (the Meyers synthesis of aldehydes, ketones, and carboxylic acids), alkylation with 
trialkylboranes, alkylation at an alk5myl carbon, preparation of nitriles, direct conversion of alkyl 

25 halides to aldehydes and ketones, conversion of alkyl halides, alcohols, or alkanes to carboxylic 
acids and their derivatives, the conversion of acyl halides to ketones with organometallic 
compounds, the conversion of anhydrides, carboxylic esters, or amides to ketones with 
organometallic compounds, the coupling of acyl halides, acylation at a carbon bearing an active 
hydrogen, acylation of carboxylic esters by carboxylic esters (the Claisen and Dieckmann 

30 condensation), acylation of ketones and nitriles with carboxylic esters, acylation of carboxylic 
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acid salts, preparation of acyl cyanides, and preparation of diazo ketones, ketonic 
decarboxylation. 

[0072] Reactions which involve nucleophilic attack at a sulfonyl sulfur atom may also be 
used in the present invention and include, for example, hydrolysis of sulfonic acid derivatives 
5 (attack by OH), formation of sulfonic esters (attack by OR), formation of sulfonamides (attack 
by nitrogen), formation of sulfonyl halides (attack by halides), reduction of sulfonyl chlorides 
(attack by hydrogen), and preparation of sulfones (attack by carbon). 

[0073] Aromatic electrophilic substitution reactions may also be used in nucleotide- 
templated chemistry. Hydrogen exchange reactions are examples of aromatic electrophilic 

10 substitution reactions that use hydrogen as the electrophile. Aromatic electrophilic substitution 
reactions which use nitrogen electrophiles include, for example, nitration and nitro-de- 
hydrogenation, nitrosation of nitroso-de-hydrogenation, diazonium coupling, direct introduction 
of the diazonium group, and amination or amino-de-hydrogenation. Reactions of this type with 
sulfur electrophiles include, for example, sulfonation, sulfo-de-hydrogenation, halosulfonation, 

15 halosulfo-de-hydrogenation, sulfurization, and sulfonylation. Reactions using halogen 
electrophiles include, for example, halogenation, and halo-de-hydrogenation. Aromatic 
electrophilic substitution reactions with carbon electrophiles include, for example, Friedel-Crafls 
alkylation, alkylation, alkyl-de-hydrogenation, Friedel-Crafts arylation (the Scholl reaction), 
Friedel-Crafls acylation, formylation with disubstituted formamides, formylation with zinc 

20 cyanide and HCl (the Gatterman reaction), formylation with chloroform (the Reimer-Tiemann 
reaction), other formylations, formyl-de-hydrogenation, carboxylation with carbonyl halides, 
carboxylation with carbon dioxide (the Kolbe-Schmitt reaction), amidation with isocyanates, iV- 
alkylcarbamoyl-de-hydrogenation, hydroxyalkylation, hydroxyalkyl-de-hydrogenation, 
cyclodehydration of aldehydes and ketones, haloalkylation, halo-de-hydrogenation, 

25 aminoalkylation, amidoalkylation, dialkylaminoalkylation, dialkylamino-de-hydrogenation, 
thioalkylation, acylation with nitriles (the Hoesch reaction), cyanation, and cyano-de- 
hydrogenation. Reactions using oxygen electrophiles include, for example, hydroxylation and 
hydroxy-de-hydrogenation. 

[0074] Rearrangement reactions include, for example, the Fries rearrangement, migration of 
30 a nitro group, migration of a nitroso group (the Fischer-Hepp Rearrangement), migration of an 
arylazo group, migration of a halogen (the Orton rearrangement), migration of an alkyl group. 
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etc. Other reaction on an aromatic ring include the reversal of a Friedel-Crafts alkylation, 
decarboxylation of aromatic aldehydes, decarboxylation of aromatic acids, the Jacobsen reaction, 
deoxygenation, desulfonation, hydro-de-sulfonation, dehalogenation, hydro-de-halogenation, and 
hydrolysis of organometallic compounds. 

5 [0075] Aliphatic electrophilic substitution reactions are also useful. Reactions using the SeI, 
Se2 (front), Se2 (back), SeI, addition-elimination, and cyclic mechanisms can be used in the 
present invention. Reactions of this type with hydrogen as the leaving group include, for 
example, hydrogen exchange (deuterio-de-hydrogenation, deuteriation), migration of a double 
bond, and keto-enol tautomerization. Reactions with halogen electrophiles include, for example, 

10 halogenation of aldehydes and ketones, halogenation of carboxylic acids and acyl halides, and 
halogenation of sulfoxides and sulfones. Reactions with nitrogen electrophiles include, for 
example, aliphatic diazonium coupling, nitrosation at a carbon bearing an active hydrogen, direct 
formation of diazo compounds, conversion of amides to a-azido amides, direct amination at an 
activated position, and insertion by nitrenes. Reactions with sulfur or selenium electrophiles 

15 include, for example, sulfenylation, sulfonation, and selenylation of ketones and carboxylic 
esters. Reactions with carbon electrophiles include, for example, acylation at an aliphatic 
carbon, conversion of aldehydes to p-keto esters or ketones, cyanation, cyano-de-hydrogenation, 
alkylation of alkanes, the Stork enamine reaction, and insertion by carbenes. Reactions with 
metal electrophiles include, for example, metalation with organometallic compounds, metalation 

20 with metals and strong bases, and conversion of enolates to silyl enol ethers. Aliphatic 
electrophilic substitution reactions with metals as leaving groups include, for example, 
replacement of metals by hydrogen, reactions between organometallic reagents and oxygen, 
reactions between organometallic reagents and peroxides, oxidation of trialkylboranes to borates, 
conversion of Grignard reagents to sulfur compounds, halo-de-metalation, the conversion of 

25 organometallic compounds to amines, the conversion of organometallic compounds to ketones, 
aldehydes, carboxylic esters and amides, cyano-de-metalation, transmetalation with a metal, 
transmetalation with a metal halide, transmetalation with an organometallic compound, reduction 
of alkyl halides, metallo-de-halogenation, replacement of a halogen by a metal from an 
organometallic compound, decarboxylation of aliphatic acids, cleavage of alkoxides, 

30 replacement of a carboxyl group by an acyl group, basic cleavage of p-keto esters and p- 

diketones, haloform reaction, cleavage of non-enolizable ketones, the Haller-Bauer reaction, 
cleavage of alkanes, decyanation, and hydro-de-cyanation. Electrophlic substitution reactions at 
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nitrogen include, for example, diazotization, conversion of hydrazines to azides, iV-nitrosation, 
iV-nitroso-de-hydrogenation, conversion of amines to azo compounds, iV-halogenation, iV-halo- 
de-hydrogenation, reactions of amines with carbon monoxide, and reactions of amines with 
carbon dioxide. 

5 [0076] Aromatic nucleophilic substitution reactions may also be used in the present 

invention. Reactions proceeding via the SnAt mechanism, the SnI mechanism, the benzyne 
mechanism, the SrnI mechanism, or other mechanism, for example, can be used. Aromatic 
nucleophilic substitution reactions with oxygen nucleophiles include, for example, hydroxy-de- 
halogenation, alkali fusion of sulfonate salts, and replacement of OR or OAr. Reactions with 

10 sulfur nucleophiles include, for example, replacement by SH or SR. Reactions using nitrogen 
nucleophiles include, for example, replacement by NH2, NHR, or NR2, and replacement of a ' 
hydroxy group by an amino group. Reactions with halogen nucleophiles include, for example, 
the introduction halogens. Aromatic nucleophilic substitution reactions with hydrogen as the 
nucleophile include, for example, reduction of phenols and phenolic esters and ethers, and 

15 reduction of halides and nitro compounds. Reactions with carbon nucleophiles include, for 

example, the Rosenmund-von Braun reaction, coupling of organometallic compounds with aryl 
halides, ethers, and carboxylic esters, arylation at a carbon containing an active hydrogen, 
conversions of aryl substrates to carboxylic acids, their derivatives, aldehydes, and ketones, and 
the Ullmann reaction. Reactions with hydrogen as the leaving group include, for example, 

20 alkylation, arylation, and amination of nitrogen heterocycles. Reactions with N2"^ as the leaving 
group include, for example, hydroxy-de-diazoniation, replacement by sulfur-containing groups, 
iodo-de-diazoniation, and the Schiemann reaction. Rearrangement reactions include, for 
example, the von Richter rearrangement, the Sommelet-Hauser rearrangement, rearrangement of 
aryl hydroxylamines, and the Smiles rearrangement. 

25 [0077] Reactions involving free radicals can also be used, although the free radical reactions 
used in nucleotide-templated chemistry should be carefully chosen to avoid modification or 
cleavage of the nucleotide template. With that limitation, free radical substitution reactions can 
be used in the present invention. Particular free radical substitution reactions include, for 
example, substitution by halogen, halogenation at an alkyl carbon, allylic halogenation, benzylic 

30 halogenation, halogenation of aldehydes, hydroxylation at an aliphatic carbon, hydroxy lation at 
an aromatic carbon, oxidation of aldehydes to carboxylic acids, formation of cyclic ethers, 
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formation of hydroperoxides, formation of peroxides, acyloxylation, acyloxy-de-hydrogenation, 
chlorosulfonation, nitration of alkanes, direct conversion of aldehydes to amides, amidation and 
amination at an alkyl carbon, simple coupling at a susceptible position, coupling of alkynes, 
arylation of aromatic compounds by diazonium salts, arylation of activated alkenes by diazonium 
5 salts (the Meerwein arylation), arylation and alkylation of alkenes by organopalladium 

compounds (the Heck reaction), arylation and alkylation of alkenes by vinyltin compounds (the 
Stille reaction), alkylation and arylation of aromatic compounds by peroxides, photochemical 
arylation of aromatic compounds, alkylation, acylation, and carbalkoxylation of nitrogen 
heterocycles Particular reactions in which N2'*" is the leaving group include, for example, 

10 replacement of the diazonium group by hydrogen, replacement of the diazonium group by 
chlorine or bromine, nitro-de-diazoniation, replacement of the diazonium group by sulfur- 
containing groups, aryl dimerization with diazonium salts, methylation of diazonium salts, 
vinylation of diazonium salts, arylation of diazonium salts, and conversion of diazonium salts to 
aldehydes, ketones, or carboxylic acids. Free radical substitution reactions with metals as 

15 leaving groups include, for example, coupling of Grignard reagents, coupling of boranes, and 
coupling of other organometallic reagents. Reaction with halogen as the leaving group are 
included. Other free radical substitution reactions with various leaving groups include, for 
example, desulfurization with Raney Nickel, conversion of sulfides to organolithium 
compounds, decarboxylative dimerization (the Kolbe reaction), the Hunsdiecker reaction, 

20 decarboxylative allylation, and decarbonylation of aldehydes and acyl halides. 

[0078] Reactions involving additions to carbon-carbon multiple bonds are also used in 
nucleotide-templated chemistry. Any mechanism may be used in the addition reaction including, 
for example, electrophilic addition, nucleophilic addition, fi-ee radical addition, and cyclic 
mechanisms. Reactions involving additions to conjugated systems can also be used. Addition to 

25 cyclopropane rings can also be utilized. Particular reactions include, for example, isomerization, 
addition of hydrogen halides, hydration of double bonds, hydration of triple bonds, addition of 
alcohols, addition of carboxylic acids, addition of H2S and thiols, addition of ammonia and 
amines, addition of amides, addition of hydrazoic acid, hydrogenation of double and triple 
bonds, other reduction of double and triple bonds, reduction of the double and triple bonds of 

30 conjugated systems, hydrogenation of aromatic rings, reductive cleavage of cyclopropanes, 

hydroboration, other hydrometalations, addition of alkanes, addition of alkenes and/or alkynes to 
alkenes and/or alkynes (e.g., pi-cation cyclization reactions, hydro-alkenyl-addition), ene 
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reactions, the Michael reaction, addition of organometallics to double and triple bonds not 
conjugated to carbonyls, the addition of two alkyl groups to an alkyne, 1,4-addition of 
organometallic compounds to activated double bonds, addition of boranes to activated double 
bonds, addition of tin and mercury hydrides to activated double bonds, acylation of activated 
5 double bonds and of triple bonds, addition of alcohols, amines, carboxylic esters, aldehydes, etc,^ 
carbonylation of double and triple bonds, hydrocarboxylation, hydroformylation, addition of 
aldehydes, addition of HCN, addition of silanes, radical addition, radical cyclization, 
halogenation of double and triple bonds (addition of halogen, halogen), halolactonization, 
halolactamization, addition of hypohalous acids and hypohalites (addition of halogen, oxygen), 

10 addition of sulfur compounds (addition of halogen, sulfur), addition of halogen and an amino 
group (addition of halogen, nitrogen), addition of NOX and NO2X (addition of halogen, 
nitrogen), addition of XN3 (addition of halogen, nitrogen), addition of alkyl halides (addition of 
halogen, carbon), addition of acyl halides (addition of halogen, carbon), hydroxy lation (addition 
of oxygen, oxygen) (e.g., asymmetric dihydroxylation reaction with OSO4), dihydroxylation of 

15 aromatic rings, epoxidation (addition of oxygen, oxygen) (e.g., Sharpless asymmetric 

epoxidation), photooxidation of dienes (addition of oxygen, oxygen), hydroxysulfenylation 
(addition of oxygen, sulfur), oxyamination (addition of oxygen, nitrogen), diamination (addition 
of nitrogen, nitrogen), formation of aziridines (addition of nitrogen), aminosulfenylation 
(addition of nitrogen, sulfur), acylacyloxylation and acylamidation (addition of oxygen, carbon 

20 or nitrogen, carbon), 1,3-dipolar addition (addition of oxygen, nitrogen, carbon), Diels-Alder 
reaction, heteroatom Diels-Alder reaction, all carbon 3 +2 cycloadditions, dimerization of 
alkenes, the addition of carbenes and carbenoids to double and triple bonds, trimerization and 
tetramerization of alkynes, and other cycloaddition reactions. 

[0079] In addition to reactions involving additions to carbon-carbon multiple bonds, addition 
25 reactions to carbon-hetero multiple bonds can be used in nucleotide-templated chemistry. 
Exemplary reactions include, for example, the addition of water to aldehydes and ketones 
(formation of hydrates), hydrolysis of carbon-nitrogen double bond, hydrolysis of aliphatic nitro 
compounds, hydrolysis of nitriles, addition of alcohols and thiols to aldehydes and ketones, 
reductive alkylation of alcohols, addition of alcohols to isocyanates, alcoholysis of nitriles, 
30 formation of xanthates, addition of H2S and thiols to carbonyl compounds, formation of bisulfite 
addition products, addition of amines to aldehydes and ketones, addition of amides to aldehydes, 
reductive alkylation of ammonia or amines, the Mannich reaction, the addition of amines to 
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isocyanates, addition of ammonia or amines to nitriles, addition of amines to carbon disulfide 
and carbon dioxide, addition of hydrazine derivative to carbonyl compounds, formation of 
oximes, conversion of aldehydes to nitriles, formation of gem-dihalides fi-om aldehydes and 
ketones, reduction of aldehydes and ketones to alcohols, reduction of the carbon-nitrogen double 
5 bond, reduction of nitriles to amines, reduction of nitriles to aldehydes, addition of Grignard 
reagents and organolithium reagents to aldehydes and ketones, addition of other organometallics 
to aldehydes and ketones, addition of trialkylallylsilanes to aldehydes and ketones, addition of 
conjugated alkenes to aldehydes (the Baylis-Hillman reaction), the Reformatsky reaction, the 
conversion of carboxylic acid salts to ketones with organometallic compounds, the addition of 

10 Grignard reagents to acid derivatives, the addition of organometallic compounds to CO2 and CS2, 
addition of organometallic compounds to C=N compounds, addition of carbenes and 
diazoalkanes to C=N compounds, addition of Grignard reagents to nitriles and isocyanates, the 
Aldol reaction, Mukaiyama Aldol and related reactions, Aldol-type reactions between carboxylic 
esters or amides and aldehydes or ketones, the Knoevenagel reaction (e.g., the Nef reaction, the 

15 Favorskii reaction), the Peterson alkenylation reaction, the addition of active hydrogen 
compounds to CO2 and CS25 the Perkin reaction, Darzens glycidic ester condensation, the 
ToUens' reaction, the Wittig reaction, the Tebbe alkenylation, the Petasis alkenylation, 
alternative alkenylations, the Thorpe reaction, the Thorpe-Ziegler reaction, addition of silanes, 
formation of cyanohydrins, addition of HCN to C=N and C=N bonds, the Prins reaction, the 

20 benzoin condensation, addition of radicals to C=0, C=S, C=N compounds, the Ritter reaction, 
acylation of aldehydes and ketones, addition of aldehydes to aldehydes, the addition of 
isocyanates to isocyanates (formation of carbodiimides), the conversion of carboxylic acid salts 
to nitriles, the formation of epoxides from aldehydes and ketones, the formation of episulfides 
and episulfones, the formation of p-lactones and oxetanes (e.g,, the Paterno-Biichi reaction), the 

25 formation of P-lactams, etc. Reactions involving addition to isocyanides include the addition of 
water to isocyanides, the Passerini reaction, the Ug reaction, and the formation of metalated 
aldimines. 

[0080] Elimination reactions, including a, p, and y eliminations, as well as extrusion 
reactions, can be performed using nucleotide-templated chemistry, although the strength of the 
30 reagents and conditions employed should be considered. Preferred elimination reactions include 
reactions that go by El, E2, ElcB, or E2C mechanisms. Exemplary reactions include, for 
example, reactions in which hydrogen is removed from one side (e.g., dehydration of alcohols. 
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cleavage of ethers to alkenes, the Chugaev reaction, ester decomposition, cleavage of quartemary 
ammonium hydroxides, cleavage of quaternary ammonium salts with strong bases, cleavage of 
amine oxides, pyrolysis of keto-ylids, decomposition of toluene-p-solfonylhydrazones, cleavage 
of sulfoxides, cleavage of selenoxides, cleavage of sulfomes, dehydrogalogenation of alkyl 
5 halides, dehydrohalogeriation of acyl halides, dehydrohalogenation of sulfonyl halides, 
elimination of boranes, conversion of alkenes to alkynes, decarbonylation of acyl halides), 
reactions in which neither leaving atom is hydrogen (e.g., deoxygenation of vicinal diols, 
cleavage of cyclic thionocarbonates, conversion of epoxides to episulfides and alkenes, the 
Ramberg-BScklund reaction, conversion of aziridines to alkenes, dehalogenation of vicinal 

10 dihalides, dehalogenation of a-halo acyl halides, and elimination of a halogen and a hetero 

group), fragmentation reactions (i.e., reactions in which carbon is the positive leaving group or 
the electrofuge, such as, for example, fragmentation of y-amino and y-hydroxy halides, 
fragmentation of 1,3-diols, decarboxylation of p-hydroxy carboxylic acids, decarboxylation of p- 
lactones, fragmentation of a,p-epoxy hydrazones, elimination of CO from briged bicyclic 

15 compounds, and elimination of CO2 from bridged bicyclic compounds), reactions in which C=N 
or C=N bonds are formed (e.g., dehydration of aldoximes or similar compounds, conversion of 
ketoximes to nitriles, dehydration of unsubstituted amides, and conversion of N-alkylformamides 
to isocyanides), reactions in which C=0 bonds are formed (e.g., pyrolysis of P-hydroxy alkenes), 
and reactions in which N=N bonds are formed (e.g., eliminations to give diazoalkenes). 

20 Extrusion reactions include, for example, extrusion of N2 from pyrazolines, extrusion of N2 from 
pyrazoles, extrusion of N2 from triazolines, extrusion of CO, extrusion of CO2, extrusion of SO2, 
the Story synthesis, and alkene synthesis by twofold extrusion. 

[0081] Rearrangements, including, for example, nucleophilic rearrangements, electrophilic 
rearrangements, prototropic rearrangements, and free-radical rearrangements, can also be 

25 performed using nucleotide-templated chemistry. Both 1,2 rearrangements and non-1,2 

rearrangements can be performed. Exemplary reactions include, for example, carbon-to-carbon 
migrations of R, H, and Ar (e.g, Wagner-Meerwein and related reactions, the Pinacol 
rearrangement, ring expansion reactions, ring contraction reactions, acid-catalyzed 
rearrangements of aldehydes and ketones, the dienone-phenol rearrangement, the Favorskii 

30 rearrangement, the Amdt-Eistert synthesis, homologation of aldehydes, and homologation of 
ketones), carbon-to-carbon migrations of other groups (e.g., migrations of halogen, hydroxyl, 



wo 2006/135654 



PCT/US2006/022207 



-29- 

amino, etc.; migration of boron; and the Neber rearrangement), carbon-to-nitrogen migrations of 
R and Ar (e.g., the Hofmann rearrangement, the Curtius rearrangement, the Lossen 
rearrangement, the Schmidt reaction, the Beckman rearrangement, the Stieglits rearrangement, 
and related rearrangements), carbon-to-oxygen migrations of R and Ar (e.g., the Baeyer-Villiger 

5 rearrangement and rearrangment of hydroperoxides), nitrogen-to-carbon, oxygen-to-carbon, and 
sulfur-to-carbon migration (e.g., the Stevens rearrangement, and the Wittig rearrangement), 
boron-to-carbon migrations (e.g., conversion of boranes to alcohols (primary or otherwise), 
conversion of boranes to aldehydes, conversion of boranes to carboxylic acids, conversion of 
vinylic boranes to alkenes, formation of alkynes from boranes and acetylides, formation of 

10 alkenes from boranes and acetylides, and formation of ketones from boranes and acetylides), 
electrocyclic rearrangements (e.g., of cyclobutenes and 1,3-cyclohexadienes, or conversion of 
stilbenes to phenanthrenes), sigmatropic rearrangements (e.g., (1 J) sigmatropic migrations of 
hydrogen, (1 j) sigmatropic migrations of carbon, conversion of vinylcyclopropanes to 
cyclopentenes, the Cope rearrangement, the Claisen rearrangement, the Fischer indole synthesis, 

15 (2,3) sigmatropic rearrangements, and the benzidine rearrangement), other cyclic rearrangements 
(e.g., metathesis of alkenes, the di-TC-methane and related rearrangements, and the Hofinann- 
Loffler and related reactions), and non-cyclic rearrangements (e.g., hydride shifts, the Chapman 
rearrangement, the Wallach rearrangement, and dyotropic rearrangements). 

[0082] Oxidative and reductive reactions may also be performed using nucleotide-templated 
20 chemistry. Exemplary reactions may involve, for example, direct electron transfer, hydride 

transfer, hydrogen-atom transfer, formation of ester intermediates, displacement mechanisms, or 
addition-elimination mechanisms. Exemplary oxidations include, for example, eliminations of 
hydrogen (e.g., aromatization of six-membered rings, dehydrogenations yielding carbon-carbon 
double bonds, oxidation or dehydrogenation of alcohols to aldehydes and ketones, oxidation of 
25 phenols and aromatic amines to quinones, oxidative cleavage of ketones, oxidative cleavage of 
aldehydes, oxidative cleavage of alcohols, ozono lysis, oxidative cleavage of double bonds and 
aromatic rings, oxidation of aromatic side chains, oxidative decarboxylation, and 
bisdecarboxylation), reactions involving replacement of hydrogen by oxygen (e.g., oxidation of 
methylene to carbonyl, oxidation of methylene to OH, CO2R, or OR, oxidation of arylmethanes, 
30 oxidation of ethers to carboxylic esters and related reactions, oxidation of aromatic hydrocarbons 
to quinones, oxidation of amines or nitro compounds to aldehydes, ketones, or dihalides, 
oxidation of primary alcohols to carboxylic acids or carboxylic esters, oxidation of alkenes to 
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aldehydes or ketones, oxidation of amines to nitroso compounds and hydroxylamines, oxidation 
of primary amines, oximes, azides, isocyanates, or notroso compounds, to nitro compounds, 
oxidation of thiols and other sulfur compounds to sulfonic acids), reactions in which oxygen is 
added to the subtrate (e.g,, oxidation of alkynes to a-diketones, oxidation of tertiary amines to 
5 amine oxides, oxidation of thioesters to sulfoxides and sulfones, and oxidation of carboxylic 
acids to peroxy acids), and oxidative coupling reactions (e,g,, coupling involving carbanoins, 
dimerization of silyl enol ethers or of lithium enolates, and oxidation of thiols to disulfides). 

[0083] Exemplary reductive reactions include, for example, reactions involving replacement 
of oxygen by hydrogen (e.g., reduction of carbonyl to methylene in aldehydes and ketones, 
10 reduction of carboxylic acids to alcohols, reduction of amides to amines, reduction of carboxylic 
esters to ethers, reduction of cyclic anhydrides to lactones and acid derivatives to alcohols, 
reduction of carboxylic esters to alcohols, reduction of carboxylic acids and esters to alkanes, 
complete reduction of epoxides, reduction of nitro compounds to amines, reduction of nitro 
compounds to hydroxylamines, reduction of nitroso compounds and hydroxylamines to amines, 

15 reduction of oximes to primary amines or aziridines, reduction of azides to primary amines, 

reduction of nitrogen compounds, and reduction of sulfonyl halides and sulfonic acids to thiols), 
removal of oxygen from the substrate (e.g., reduction of amine oxides and azoxy compounds, 
reduction of sulfoxides and sulfones, reduction of hydroperoxides and peroxides, and reduction 
of aliphatic nitro compounds to oximes or nitriles), reductions that include cleavage (e.g., de- 

20 alkylation of amines and amides, reduction of azo, azoxy, and hydrazo compounds to amines, 
and reduction of disulfides to thiols), reductive couplic reactions (e.g., bimolecular reduction of 
aldehydes and ketones to 1,2-diols, bimolecular reduction of aldehydes or ketones to alkenes, 
acyloin ester condensation, reduction of nitro to azoxy compounds, and reduction of nitro to azo 
compounds), and reductions in which an organic substrate is both oxidized and reduced (e.g., the 

25 Cannizzaro reaction, the Tishchenlco reaction, the Pummerer rearrangement, and the Willgerodt 
reaction). 

IV. SELECTION AND SCREEJVING 

[0084] Selection and/or screening for reaction products with desired activities (such as 
catalytic activity, binding affinity, or a particular effect in an activity assay) may be performed 
30 using methodologies known and used in the art. For example, affinity selections may be 

performed according to the principles used in library-based selection methods such as phage 
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display, polysome display, and mRNA-fusion protein displayed peptides. Selection for catalytic 
activity may be performed by affinity selections on transition-state analog affinity columns 
(Baca et ah (1997) Proc. Natl. Acad. Scl USA 94(19): 10063-8) or by function-based 
selection schemes (Pedersen et aL (1998) Proc. Natl. Acad. Sci. USA 95(18): 10523-8). Since 
5 minute quantities of DNA (-10"^° mol) can be amplilBed by PGR (Kramer et al (1999) Current 
Protocols in Molecular Biology (ed. Ausubel, F. M.) 15.1-15.3, Wiley), these selections 
can be conducted on a scale ten or more orders of magnitude less than that required for reaction 
analysis by current methods, making a truly broad search both economical and efficient. 

(i) Selection for Binding to Target Molecule 

10 [0085] The templates and reaction products can be selected (or screened) for binding to a 
target molecule. In this context, selection or partitioning means any process whereby a library 
member bound to a target molecule is separated from library members not bound to target 
molecules. Selection can be accomplished by various methods known in the art. 

[0086] The templates of the present invention contain a built-in function for direct selection 
15 and amplification. In most applications, binding to a target molecule preferably is selective, such 
that the template and the resulting reaction product bind preferentially with a specific target 
molecule, perhaps preventing or inducing a specific biological effect. Ultimately, a binding 
molecule identified using the present invention may be usefijl as a therapeutic and/or diagnostic 
agent. Once the selection is complete, the selected templates optionally can be amplified and 
20 sequenced. The selected reaction products, if present in sufficient quantity, can be separated 

from the templates, purified (e.g., by HPLC, column chromatography, or other chromatographic 
method), and further characterized. 

(ii) Target Molecules 

[0087] Binding assays provide a rapid means for isolating and identifying reaction products 
25 that bind to, for example, a surface (such as metal, plastic, composite, glass, ceramics, rubber, 
skin, or tissue); a polymer; a catalyst; or a target biomolecule such as a nucleic acid, a protein 
(including enzymes, receptors, antibodies, and glycoproteins), a signal molecule (such as cAMP, 
inositol triphosphate, peptides, or prostaglandins), a carbohydrate, or a lipid. Binding assays can 
be advantageously combined with activity assays for the effect of a reaction product on a 
30 function of a target molecule. 
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[0088] The selection strategy can be carried out to allow selection against almost any target. 
Importantly, the selection strategy does not require any detailed structural information about the 
target molecule or about the molecules in the libraries. The entire process is driven by the 
binding affinity involved in the specific recognition and binding of the molecules in the library to 
5 a given target Examples of various selection procedures are described below. 

[0089] The libraries of the present invention can contain molecules that could potentially 
bind to any known or unknown target. The binding region of a target molecule could include a 
catalytic site of an enzyme, a binding pocket on a receptor (for example, a G-protein coupled 
receptor), a protein surface area involved in a protein-protein or protein-nucleic acid interaction 
10 (preferably a hot-spot region), or a specific site on DNA (such as the major groove). The natural 
function of the target could be stimulated (agonized), reduced (antagonized), unaffected, or 
completely changed by the binding of the reaction product. This will depend on the precise 
binding mode and the particular binding site the reaction product occupies on the target. 

[0090] Functional sites (such as protein-protein interaction or catalytic sites) on proteins 
15 often are more prone to bind molecules than are other more neutral surface areas on a protein. In 
addition, these functional sites normally contain a smaller region that seems to be primarily 
responsible for the binding energy: the so-called "hot-spot regions" (Wells, et al. (1993) RECENT 
Prog. Hormone Res. 48: 253- 262). This phenomenon facilitates selection for molecules 
affecting the biological function of a certain target. 

20 [0091] The linkage between the template molecule and reaction product allows rapid 

identification of binding molecules using various selection strategies. This invention broadly 
permits identifying binding molecules for any known target molecule. In addition, novel 
unlcnown targets can be discovered by isolating binding molecules against unknown antigens 
(epitopes) and using these binding molecules for identification and validation. In another 

25 preferred embodiment, the target molecule is designed to mimic a transition state of a chemical 
reaction; one or more reaction products resulting from the selection may stabilize the transition 
state and catalyze the chemical reaction. 

(Hi) Binding Assays 

[0092] The template-directed synthesis of the invention permits selection procedures 
30 analogous to other display methods such as phage display (Smith (1985) Science 228: 1315- 
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1317). Phage display selection has been used successfully on peptides (Wells et ah (1992) 
CURR. Op, Struct. Biol. 2: 597-604), proteins (Marks et al (1992) J. BiOL. Chem. 267: 16007- 
16010) and antibodies (Winter et al (1994) Annu. Rev. Immunol. 12: 433-455). Similar 
selection procedures also are exploited for other types of display systems such as ribosome 
5 display Mattheakis et al (1 994) Proc. Natl, Acad. Scl 91 : 9022-9026) and mRNA display 
(Roberts, etal (1997) pROC. Natl^ AcAD. Scl 94:12297-302), The libraries of the present 
invention, however, allow direct selection of target-specific molecules without requiring 
traditional ribosome-mediated translation. The present invention also allows the display of small 
molecules which have not previously been synthesized directly from a nucleic acid template. 

10 [0093] Selection of binding molecules from a library can be performed in any format to 
identify optimal binding molecules. Binding selections typically involve immobilizing the 
desired target molecule, adding a library of potential binders, and removing non-binders by 
washing. When the molecules showing low affinity for an immobilized target are washed away, 
the molecules with a stronger affinity generally remain attached to the target. The enriched 

1 5 population remaining bound to the target after stringent washing is preferably eluted with, for 
example, acid, chaotropic salts, heat, competitive elution with a known ligand or by proteolytic 
release of the target and/or of template molecules. The eluted templates are suitable for PGR, 
leading to many orders of amplification, whereby essentially each selected template becomes 
available at a greatly increased copy number for cloning, sequencing, and/or further enrichment 

20 or diversification. 

[0094] In a binding assay, when the concentration of ligand is much less than that of the 
target (as it would be during the selection of a DNA-tempIated library), the fraction of ligand 
bound to target is determined by the effective concentration of the target protein. The fraction of 
ligand bound to target is a sigmoidal function of the concentration of target, with the midpoint 
25 (50% bound) at [target] ^ Ke, of the ligand-target complex. This relationship indicates that the 
stringency of a specific selection — the minimum ligand affinity required to remain bound to the 
target during the selection — is determined by the target concentration. Therefore, selection 
stringency is controllable by varying the effective concentration of target 

[0095] The target molecule (peptide, protein, DNA or other antigen) can be immobilized on 
30 a solid support, for example, a container wall, a wall of a microtiter plate well. The library 

preferably is dissolved in aqueous binding buffer in one pot and equilibrated in the presence of 
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immobilized target molecule. Non-binders are washed away with buffer. Those molecules that 
may be binding to the target molecule through their attached DNA templates rather than through 
their synthetic moieties can be eliminated by washing the bound library with unfiinctionalized 
templates lacking PGR primer binding sites. Remaining bound library members then can be 
eluted, for example, by denaturation. 

[0096] Alternatively, the target molecule can be immobilized on beads, particularly if there 
is doubt that the target molecule will adsorb sufficiently to a container wall, as may be the case 
for an unfolded target eluted from an SDS-PAGE gel. The derivatized beads can then be used to 
separate high-affinity library members from nonbinders by simply sedimenting the beads in a 
benchtop centrifuge. Alternatively, the beads can be used to make an affinity column. In such 
cases, the library is passed through the column one or more times to permit binding. The column 
then is washed to remove nonbinding library members. Magnetic beads are essentially a variant 
on the above; the target is attached to magnetic beads which are then used in the selection. 

[0097] There are many reactive matrices available for immobilizing the target molecule, 
including matrices bearing "-NH2 groups or -SH groups. The target molecule can be immobilized 
by conjugation with NHS ester or maleimide groups covalently linked to Sepharose beads and 
the integrity of known properties of the target molecule can be verified. Activated beads are 
available with attachment sites for -NH2 or -COOH groups (which can be used for coupling). 
Alternatively, the target molecule is blotted onto nitrocellulose or PVDF. When using a blotting 
strategy, the blot should be blocked (e.g., with BSA or similar protein) after immobilization of 
the target to prevent nonspecific binding of library members to the blot. 

[0098] Library members that bind a target molecule can be released by denaturation, acid, or 
chaotropic salts. Alternatively, elution conditions can be more specific to reduce background or 
to select for a desired specificity. Elution can be accomplished using proteolysis to cleave a 
linker between the target molecule and the immobilizing surface or between the reaction product 
and the template. Also, elution can be accomplished by competition with a known competitive 
ligand for the target molecule. Alternatively, a PGR reaction can be performed directly in the 
presence of the washed target molecules at the end of the selection procedure. Thus, the binding 
molecules need not be elutable from the target to be selectable since only the template is needed 
for further amplification or cloning, not the reaction product itself Indeed, some target 
molecules bind the most avid ligands so tightly that elution would be difficult. 
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[0099] To select for a molecule tiiat binds a protein expressible on a cell surface, such as an 
ion channel or a transmembrane receptor, the cells themselves can be used as the selection agent. 
The library preferably is first exposed to cells not expressing the target molecule on their 
surfaces to remove library members that bind specifically or non specifically to other cell surface 
epitopes. Alternatively, cells lacking the target molecule are present in large excess in the 
selection process and separable (by fluorescence-activated cell sorting (FACS), for example) 
from cells bearing the target molecule. In either method, cells bearing the target molecule then 
are used to isolate library members bearing the target molecule (e.g., by sedimenting the cells or 
by FACS sorting). For example, a recombinant DNA encoding the target molecule can be 
introduced into a cell line; library members that bind the transformed cells but not the 
untransformed cells are enriched for target molecule binders. This approach is also called 
subtraction selection and has successfully been used for phage display on antibody libraries 
(Hoogenboom etal. (1998) Immunotech 4: 1- 20). 

[0100] A selection procedure can also involve selection for binding to cell surface receptors 
that are internalized so that the receptor together with the selected binding molecule passes into 
the cytoplasm, nucleus, or other cellular compartment, such as the Golgi or lysosomes. 
Depending on the dissociation rate constant for specific selected binding molecules, these 
molecules may localize primarily within the intracellular compartments. Internalized library 
members can be distinguished from molecules attached to the cell surface by washing the cells, 
preferably with a denaturant. More preferably, standard subcellular fractionation techniques are 
used to isolate the selected library members in a desired subcellular compartment. 

[0101] An alternative selection protocol also includes a known, weak iigand affixed to each 
member of the library. The known iigand guides the selection by interacting with a defined part 
of the target molecule and focuses the selection on molecules that bind to the same region, 
providing a cooperative effect. This can be particularly useful for increasing the affinity of a 
Iigand with a desired biological function but with too low a potency. 

[0102] Other methods for selection or partitioning are also available for use with the present 
invention. These include, for example: immunoprecipitation (direct or indirect) where the target 
molecule is captured together with library members; mobility shift assays in agarose or 
polyacrylamide gels, where the selected library members migrate with the target molecule in a 
gel; cesium chloride gradient centrifugation to isolate the target molecule with library members; 
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mass spectroscopy to identify target molecules labeled with library members. In general, any 
method where the library member/ target molecule complex can be separated from library 
members not bound to the target is useful. 

[0103] The selection process is well suited for optimizations, where the selection steps are 
5 made in series, starting with the selection of binding molecules and ending with an optimized 
binding molecule. The procedures in each step can be automated using various robotic systems. 
Thus, the invention permits supplying a suitable librar}' and target molecule to a fully automatic 
system which finally generates an optimized binding molecule. Under ideal conditions, this 
process should run without any requirement for external work outside the robotic system during 
10 the entire procedure. 

[0104] The selection methods of the present invention can be combined with secondary 
selection or screening to identify reaction products capable of modifying target molecule 
function upon binding. Thus, the methods described herein can be employed to isolate or 
produce binding molecules that bind to and modify the function of any protein or nucleic acid. 

15 For example, nucleic acid-templated chemistry can be used to identify, isolate, or produce 
binding molecules (1) affecting catalytic activity of target enzymes by inhibiting catalysis or 
modifying substrate binding; (2) affecting the functionalify of protein receptors, by inhibiting 
binding to receptors or by modifying the specificity of binding to receptors; (3) affecting the 
formation of protein multimers by disrupting the quaternary structure of protein subunits; or (4) 

20 modifying transport properties of a protein by disrupting transport of small molecules or ions. 

[0105] Functional assays can be included in the selection process. For example, after 
selecting for binding activity, selected library members can be directly tested for a desired 
functional effect, such as an effect on cell signaling. This can, for example, be performed via 
FACS methodologies. 

25 [0106] The binding molecules of the invention can be selected for other properties in 
addition to binding. For example, to select for stability of binding interactions in a desired 
working environment. If stability in the presence of a certain protease is desired, that protease 
can be part of the buffer medium used during selection. Similarly, the selection can be 
performed in serum or cell extracts or in any type of medium, aqueous or organic. Conditions 

30 that disrupt or degrade the template should however be avoided to allow subsequent 
amplification. 
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(iy) Other Selections 

[0107] Selections for other desired properties, such as catal3^ic or other functional activities, 
can also be performed. Generally, the selection should be designed such that library members 
with the desired activity are isolatable on that basis from other library members. For example, 
5 libraiy members can be screened for the ability to fold or otherwise significantly change 

conformation in the presence of a target molecule, such as a metal ion, or under particular pH or 
salinity conditions. The folded library members can be isolated by performing non-denaturing 
gel electrophoresis under the conditions of interest. The folded library members migrate to a 
different position in the gel and can subsequently be extracted from the gel and isolated. 

10 [0108] Similarly, reaction products that fluoresce in the presence of specific ligands may be 
selected by FAGS based sorting of translated polymers linked through their DNA templates to 
beads. Those beads that fluoresce in the presence, but not in the absence, of the target ligand are 
isolated and characterized. Useful beads with a homogenous population of nucleic acid- 
templates on any bead can be prepared using the split-pool synthesis technique on the bead, such 

15 that each bead is exposed to only a single nucleotide sequence. Alternatively, a different anti- 
template (each complementary to only a single, different template) can by synthesized on beads 
using a split-pool technique, and then can anneal to capture a solution-phase library. 

[0109] Biotin-terminated biopolymers can be selected for the actual catalysis of bond- 
breaking reactions by passing these biopolymers over a resin linked through a substrate to avidin. 
20 Those biopolymers that catalyze substrate cleavage self-elute from a column charged with this 
resin. Similarly, biotin-terminated biopolymers can be selected for the catalysis of bond-forming 
reactions. One substrate is linked to resin and the second substrate is linked to avidin. 
Biopolymers that catalyze bond formation between the substrates are selected by their ability to 
react the substrates together, resulting in attachment of the biopolymer to the resin. 

25 [0110] Library members can also be selected for their catalytic effects on synthesis of a 
polymer to which the template is or becomes attached. For example, the library member may 
influence the selection of monomer units to be polymerized as well as how the polymerization 
reaction takes place (e.g., stereochemistiy, tacticity, activity). The synthesized polymers can be 
selected for specific properties, such as, molecular weight, density, hydrophobicity, tacticity, 

30 stereoselectivity, using standard techniques, such as, electrophoresis, gel filtration, centrifugal 



wo 2006/135654 



PCT/US2006/022207 



-38- 

sedimentation, or partitioning into solvents of different hydrophobicities. The attached template 
that directed the synthesis of the polymer can then be identified. 

[0111] Library members that catalyze virtually any reaction causing bond formation between 
two substrate molecules or resulting in bond breakage into two product molecules can be 
5 selected using the scheines proposed herein. To select for bond forming catalysts (for example, 
hetero Diels-Alder, Heck coupling, aldol reaction, or olefin metathesis catalysts), library 
members are covalently linked to one substrate through their 5' amino or thiol termini. The 
other substrate of the reaction is synthesized as a derivative linked to biotin. When dilute 
solutions of library-substrate conjugate are combined with the substrate-biotin conjugate, those 
10 library members that catalyze bond formation cause the biotin group to become covalently 
attached to themselves. Active bond forming catalysts can then be separated from inactive 
library members by capturing the former with immobilized streptavidin and washing away 
inactive library members 

[0112] In an analogous manner, library members that catalyze bond cleavage reactions such 
15 as retro-aldol reactions, amide hydrolysis, elimination reactions, or olefin dihydroxylation 
followed by periodate cleavage can be selected. In this case, library members are covalently 
linked to biotinylated substrates such that the bond breakage reaction causes the disconnection of 
the biotin moiety from the library members. Upon incubation under reaction conditions, active 
catalysts, but not inactive library members, induce the loss of their biotin groups. Streptavidin- 
20 linked beads can then be used to capture inactive polymers, while active catalysts are able to be 
eluted from the beads. Related bond formation and bond cleavage selections have been used 
successfiilly in catalytic RNA and DNA evolution (Jaschke et al (2000) CURR'. OPIN. Chem. 
Biol. 4: 257-62) Although these selections do not explicitly select for multiple turnover 
catalysis, RNAs and DNAs selected in this manner have in general proven to be multiple 
25 turnover catalysts when separated from their substrate moieties (Jaschke et al (2000) CURR. 
Opin. Chem. Biol. 4: 257-62; Jaeger et al (1999) Proc. Natl. Acad. Sci. USA 96: 14712-7; 
Bartel etal (1993) SCIENCE 261: 1411-8; S^netal (1998) CURR. Opin. Chem. BiOL. 2: 680-7). 

[0113] In addition to simply evolving active catalysts, the in vitro selections described above 
are used to evolve non-natural polymer libraries in powerful directions difficult to achieve using 
30 other catalyst discover}^ approaches. Substrate specificitj^ among catalysts can be selected by 
selecting for active catalysts in the presence of the desired substrate and then selecting for 
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inactive catalysts in the presence of one or more undesired substrates. If the desired and 
undesired substrates differ by their configuration at one or more stereocenters, enantioselective 
or diastereoselective catalysts can emerge from rounds of selection. Similarly, metal selectivity 
can be evolved by selecting for active catalysts in the presence of desired metals and selecting 
5 for inactive catalysts in the presence of undesired metals. Conversely, catalysts with broad 

substrate tolerance can be evolved by varying substrate structures between successive rounds of 
selection. 

[0114] Importantly, in vitro selections can also select for specificity in addition to binding 
affinity. Library screening methods for binding specificity typically require duplicating the 

10 entire screen for each target or non-target of interest. In contrast, selections for specificity can be 
performed in a single experiment by selecting for target binding as well as for the inability to 
bind one or more non-targets. Thus, the library can be pre-depleted by removing library 
members that bind to a non-target. Alternatively, or in addition, selection for binding to the 
target molecule can be performed in the presence of an excess of one or more non-targets. To 

15 maximize specificity, the non-target can be a homologous molecule. If the target molecule is a 
protein, appropriate non-target proteins include, for example, a generally promiscuous protein 
such as an albumin. If the binding assay is designed to target only a specific portion of a target 
molecule, the non-target can be a variation on the molecule in which that portion has been 
changed or removed. 

20 (vi) Amplification and Sequencing 

[0115] Once all rounds of selection are complete, the templates which are, or formerly were, 
associated with the selected reaction product preferably are amplified using any suitable 
technique to facilitate sequencing or other subsequent manipulation of the templates. Natural 
oligonucleotides can be amplified by any state of the art method. These methods include, for 

25 example, polymerase chain reaction (PGR); nucleic acid sequence-based amplification (see, for 
example, Compton (1991) Nature 350: 91-92), amplified anti-sense RNA (see, for example, 
van Gelder et al (1988) Proc. NATL. Acad. Sci. USA 85: 77652-77656); self-sustained 
sequence replication systems (Gnatelli et al (1990) Proc. NAtl. Acad. Sci. USA 87: 1874- 
1878); polymerase-independent amplification (see, for example, Schmidt et al (1997) NUCLEIC 

30 Acids Res. 25: 4797-4802, and in vivo amplification of plasmids carrying cloned DNA 

fragments. Descriptions of PGR methods are found, for example, in Saiki et al (1985) Science 
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230: 1350-1354; Scharf al (1986) Science 233: 1076-1078; and in U.S. Patent No. 4,683,202. 
Ligase-mediated amplification methods such as Ligase Chain Reaction (LCR) may also be used. 
In general, any means allowing faithful, efficient amplification of selected nucleic acid 
sequences can be employed in the method of the present invention. It is preferable, although not 
5 necessary, that the proportionate representations of the sequences after amplification reflect the 
relative proportions of sequences in the mixture before amplification. 

[0116] For non-natural nucleotides the choices of efficient amplification procedures are 
fewer. As non-natural nucleotides can be incorporated by certain enzymes including 
polymerases it will be possible to perform manual polymerase chain reaction by adding the 
10 polymerase during each extension cycle. 

[0117] For oligonucleotides containing nucleotide analogs, fewer methods for amplification 
exist. One may use non-enzyme mediated amplification schemes (Schmidt et al. (1997) NUCLEIC 
Acids Res. 25: 4797-4802). For backbone-modified oligonucleotides such as PNA and LNA, 
this amplification method may be used. Alternatively, standard PGR can be used to amplify a 
15 DNA fi-om a PNA or LNA oligonucleotide template. Before or during amplification the 

templates or complementing templates may be mutagenized or recombined in order to create an 
evolved library for the next round of selection or screening. 

(vii) Sequence Determination and Template Evolution 

[0118] Sequencing can be done by a standard dideoxy chain termination method, or by 
20 chemical sequencing, for example, using the Maxam-Gilbert sequencing procedure. 

Alternatively, the sequence of the template (or, if a long template is used, the variable portion(s) 
thereof) can be determined by hybridization to a chip. For example, a single-stranded template 
molecule associated with a detectable moiety such as a fluorescent moiety is exposed to a chip 
bearing a large number of clonal populations of single-stranded nucleic acids or nucleic acid 
25 analogs of known sequence, each clonal population being present at a particular addressable 

location on the chip. The template sequences are permitted to anneal to the chip sequences. The 
position of the detectable moieties on the chip then is determined. Based upon the location of 
the detectable moiety and the immobilized sequence at that location, the sequence of the 
template can be determined. It is contemplated that large numbers of such oligonucleotides can 
30 be immobilized in an array on a chip or other solid support. 
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[0119] Libraries can be evolved by introducing mutations at the DNA level, for example, 
using error-prone PGR (Cadwell et al (1992) PGR METHODS APPLIC. 2: 28) or by subjecting the 
DNA to in vitro homologous recombination (Stemmer (1994) Proc. Natl. Acad. Sci. USA 91: 
10747; Stemmer (1994) NATURE 370: 389) or by cassette mutagenesis. 

5 (a) Error-prone PCR 

[0120] Random point mutagenesis is performed by conducting the PGR amplification step 
under error-prone PGR (Gadwell er al (1992) PGR Methods Applic. 2: 28-33) conditions. 
Because the genetic code of these molecules are written to assign related codons to related 
chemical groups, similar to the way that the natural protein genetic code is constructed, random 
10 point mutations in the templates encoding selected molecules will diversify progeny towards 
chemically related analogs. Because error-prone PGR is inherently less efficient than normal 
PGR, error-prone PGR diversification is preferably conducted with only natural dATP, dTTP, 
dGTP, and dGTP and using primers that lack chemical handles or biotin groups. 

(b) Recombination 

15 [0121] Libraries may be diversified using recombination. For example, templates to be 
recombined may have a structure in which codons are separated by five-base non-palindromic 
restriction endonuclease cleavage sites such as those cleaved by Avail (G/GWCC, W=A or T), 
Sau96l (G/GNCC, N=A, G, T, or C), Ddel (C/TNAG), or HinVl (G/ANTC). Following 
selections, templates encoding desired molecules are enzymatically digested with these 

20 commercially available restriction enzymes. The digested fragments then are recombined into 
intact templates with T4 DNA ligase. Because the restriction sites separating codons are 
nonpalindromic, template fragments can only reassemble to form intact recombined templates 
(Figure 14). DNA-templated translation of recombined templates provides recombined small 
molecules. In this way, functional groups between synthetic small molecules with desired 

25 activities are recombined in a manner analogous to the recombination of amino acid residues 
between proteins in Nature. It is well appreciated that recombination explores the sequence 
space of a molecule much more efficiently than point mutagenesis alone (Minshull et al (1999) 
CuRR. Opin. Chem. Biol. 3: 284-90; Bogarad et al (1999) Proc. Natl. Acad. Sci. USA 96: 
2591--5; Stemmer NATURE 370: 389-391). 



wo 2006/135654 



PCT/US2006/022207 



-42- 

[0122] A preferred method of diversifying library members is through non homologous 
random recombination, as described, for example, in WO 02/074978; US Patent Application 
Publication No. 2003-00271 80-Al; and BiUker et al. (2002) Nature Biotech. 20(10): 1024-9. 

(c) Random Cassette Mutagenesis 

5 [0123] Random cassette mutagenesis is useful to create a diversified library from a fixed 
starting sequence. Thus, such a method can be used, for example, after a library has been 
subjected to selection and one or more library members have been isolated and sequenced. 
Generally, a library of oligonucleotides with variations on the starting sequence is generated by 
traditional chemical synthesis, error-prone PGR, or other methods. For example, a library of 
10 oligonucleotides can be generated in which, for each nucleotide position in a codon, the 

nucleotide has a 90% probability of being identical to the starting sequence at that position, and a 
10% probability of being different. The oligonucleotides can be complete templates when 
synthesized, or can be fragments that are subsequently ligated with other oligonucleotides to 
form a diverse library of templates. 

1 5 [0124] Information about template design, codon usage, transfer unit design, coupling 
chemistries, reaction conditions, and selection and screening protocols can be found, for 
example, in U.S. Patent Publication Nos. US-2003/01 13738 and US-2004/0 180412. 

V, USES 

[0125] The methods and compositions of the present invention represent new ways to 
20 generate polymers with desired properties. This approach marries extremely powerful genetic 
methods, which molecular biologists have taken advantage of for decades, with the flexibility 
and power of organic chemistry. The ability to prepare, amplify, and evolve unnatural polymers 
by genetic selection may lead to new classes of catalysts that possess activity, bioavailability, 
stability, fluorescence, photolability, or other properties that are difficult or impossible to achieve 
25 using the limited set of building blocks found in proteins and nucleic acids. 

[0126] For example, unnatural biopolymers useful as artificial receptors to selectively bind 
molecules or as catalysts for chemical reactions can be isolated. Characterization of these 
molecules would provide important insight into the ability of polycarbamates, polyureas, 
polyesters, polycarbonates, polypeptides with unnatural side chain and stereochemistries, or 



wo 2006/135654 



PCT/US2006/022207 



-43- 

other unnatural polymers to form secondary or tertiary structures with binding or catalytic 
properties. 

Examples 

Example 1: Polymer Evolution by Templated Synthesis 

5 [0127] A proposed scheme for synthetic polymer evolution using DNA-templated organic 
synthesis is shown in Figure 2. Peptide nucleic acid (PNA) is an attractive candidate for this 
strategy because PNA monomers are readily synthesized in the laboratory and because their 
ability to associate sequence-specifically with nucleic acids enables PNA coupling to be 
controlled by nucleic acid-templated synthesis (Nielsen, P. E. (1997) BlOPHYS. Chem. 68, 103- 

10 8; Schmidt et al (1997) NUCLEIC ACIDS RES, 25, 4797-802; Schmidt et al (1997) NUCLEIC 
ACIDS Res. 25, 4792-6; Bohler et al (1995) NATURE 376, 578-81). Previous studies have 
established the ability of DNA-templated reductive amination reactions (Li, X et al (2002) J. 
Am. Chem. Soc. 124, 746-7; Li, 'X.etal (2002) Angew Chem. Int. Ed. Engl. 41, 4567-9; 
Rosenbaum & Liu (2003) J. Am. Chem. Soc. 125, 13924-5; Gothelf a/. (2004) J. Am. Chem. 

15 Soc. 126, 1044-6) to mediate the polymerization of PNA aldehyde oligomers on DNA hairpin 
templates (Rosenbaum et al (2003) J. Am. Chem. Soc. 125, 13924-5). Although these 
reactions exhibited promising degrees of efficiency and sequence-specificity when applied to 
single DNA template sequences containing predominantly AGTC codons (Li et al (2002) J. Am. 
Chem. Soc. 124, 746-7; Li, X. & Lynn, D. G. (2002) ANGEW CHEM. INT. ED. ENGL. 41, 4567- 

20 9), they have not previously been applied to the more complex problems of translating highly 
varied templates or translating libraries containing many different templates simultaneously. 

[0128] Using DNA-templated poljraerization and a j&ameshift-resistant "genetic code", a 
library of 3.5 x 10^ DNA sequences was translated into a corresponding library of peptide 
nucleic acid 40-mers, each of which was covalently linked to its DNA template. In vitro 

25 selection for binding to a target protein followed by PGR amplification of surviving templates 
led to the identification of a PNA with confirmed protein binding activity and specificity. The 
DNA encoding this PNA was partially randomized, re-translated, and re-selected to yield a 
second-generation synthetic polymer with improved target protein affinity. The results 
demonstrate that non-enzymatic, template-directed synthesis can support the evolution-driven 

30 discovery of functional synthetic polymers that would be difficult to isolate using conventional 
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synthesis and screening methods. In addition, these results establish the ability of PNAs to adopt 
conformations capable of binding specifically to target proteins. 

1. MATERIALS AND METHODS 

(a) DNA template preparation. 

5 [0129] DNA templates were synthesized on a PerSeptive Biosystems Expedite 8090 DNA 
synthesizer using standard phosphoramidite protocols. All DNA synthesis reagents were 
purchased from Glen Research. DNA templates for polymerization experiments were 
synthesized with a 5'-MMT-amino-dT phosphoramidite at the 5' terminus (abbreviated H2NT in 
the sequences below). Purification of DNA templates was carried out by (/) deprotection with 

10 1:1 ammonium hydroxide:methylamine for 60 min at 55°C; (if) reverse-phase HPLC purification 
of the MMT-containing fraction using a [8% acetonitrile in 0.1 M TEAA pH 7] to [40% 
acetonitrile in 0.1 M TEAA pH 7] solvent gradient with a column temperature of 45°C; (iii) 
MMT cleavage with 80% acetic acid for 60 min; (iv) reverse-phase HPLC purification of the 
MMT-deprotected product. In addition, templates for library translation were subjected to 

15 PAGE purification on a 10% denaturing polyacrylamide geL 

Template sequences are as follows: 
(ATTC) i n coding region 

H2NTGCGACGGTATACCGTCGCAATTCATTCATTCATTCATTCATTCATTCATTCATTC 
ATTC (SEQ ID NO: 1) 

20 (AGTO i n coding region 

H2NTGCGACGGTATACCGTCGCAAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAG 
TCAGTC (SEQ ID NO: 2) 

(ACTO u j coding region 

H2NTGCGACGGTATACCGTCGCAACTCACTCACTCACTCACTCACTCACTCACTCACT 
25 CACTC (SEQ ID NO: 3) 

(ACTCATGC)^ coding region 

H2NTGCGACGGTATACCGTCGCAACTCATGCACTCATGCACTCATGCACTCATGCAC 
TCATGC (SEQ ID NO: 4) 
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(ACTCAGGO s coding region 

H2NTGCGACGGTATACCGTCGCAACTCAGGCACTCAGGCACTCAGGCACTCAGGCAC 
TCAGGC (SEQ ID NO: 5) 

CACGOin coding region 

5 H2NTGCGACGGTATACCGTCGCAACGCACGCACGCACGCACGCACGCACGCACGCA 
CGCACGC (SEQ ID NO: 6) 

(ATCQ h ) coding region 

H2NTGCGACGGTATACCGTCGCAATCCATCCATCCATCCATCCATCCATCCATCCATC 
CATCC (SEQ ID NO: 7) 

10 f AGCO i n coding region 

H2NTGCGACGGTATACCGTCGCAAGCCAGCCAGCCAGCCAGCCAGCCAGCCAGCCA 
GCCAGCC (SEQ ID NO: 8) 

CACCQjn coding region 

H2NTGCGACGGTATACCGTCGCAACCCACCCACCCACCCACCCACCCACCCACCCAC 
15 CCACCC (SEQ ID NO: 9) 

First-generation library 

H2NTGCGACGGTGCGCACCGTCGCAABBCABBCABBCABBCABBCABBCABBCABBC 
ABBCABBCGGACAAGGTGCGCACCTTGTCC (SEQ ID NO: 10) 

Second-generation library 

20 H2NTGCGACGGTGCGCACCGTCGCAAGCCATTCATGCATBCABBCABBCATGCATTC 
ATTCATGCGGACAAGGTGCGCACCTTGTCC (SEQ ID NO: 11) 

(b) PNA building block preparation. 

[0130] PNA aldehyde building bloclcs were synthesized as reported preyiously (Li (2002) 
supra). Masses of the synthesized building blocks were verified by ESI mass spectrometry, and 
25 expected and observed masses of the building blocks are set forth in Table2: 
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T ABLE 2 



Sequence 


Expected mass 


Observed iriass. 


gaat 


1109 


1109.4 


gagt 


1125 


1125.4 


ggat 


1125 


1125.4 


gggt 


1141 


1141.4 


ggct 


1101 


1101.4 


gcat 


1085 


1085.4 


gcgt 


1101 


1101.4 


gact 


1085 


1085.4 


gcct 


1061 


1061.4 



(c ) DNA-templated polymerization. 

[0131] For the reactions presented in Figure 3A, 20 pmol template DNA (containing 200 
5 pmol total of four-base codons) was mixed with 800 pmol (4.0 equiv) PNA peptide aldehyde in 
50 luL of 100 mM TAPS pH 8.5 buffer containmg 1 M NaCl The gcat and gcct building blocks 
were tested in hetero-polymerization reactions due to the tendency of their homo-polymerization 
templates to adopt internal secondary structure (gcat) or exhibit unusually low PNA-DNA 
melting temperature (Giesen, U. et al (1998) Nucleic Acids Res. 26, 5004-5006) (gcct). 

10 Reactions were heated to 95 °C for 10 min and cooled to 25 °C over 1 hour. NaCNBHs was 
added to 80 mM. Reactions were allowed to proceed 1 hour at 25 °C, then subjected to gel 
filtration twice (Princeton Separation). Products were analyzed by 10% denaturing PAGE. The 
same conditions were used for library polymerizations presented in Figure 3B, except that 20 
pmol library DNA template was used together with an equimolar mixture of nine gvvt PNA 

15 aldehyde building blocks (total peptide = 800 pmol, 1600 pmol, or 3200 pmol as specified 
below). 

(d) Displacement of the PNA strand. 
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[0132] For the library material used in the selections presented in Figures 4 and 6, library 
polymerization was carried out using the protocol described above but with 3200 pmol (16.0 
equiv) PNA gvvt per 50 ]xL reaction. After gel filtration, one-half of the translation reaction 
('-10 pmol of product) was resuspended in 25 |liL Thermopol buffer (New England Biolabs), 0.25 
5 |LiL 25 mM dNTPs, and 0.5 fxL (1 U) Therminator DNA polymerase (NEB). The displacement 
mixture was heated to 95 °C for 5 min, cooled to 55 °C for 45 s, and incubated at 72 °C for 30 
min. After cooling to 25 the reaction was passed through a gel filtration column to remove 
buffer and dNTPs. 

[0133] The PNA displacement of an individual translated double-hairpin template was 
10 typically analyzed as follows. The sequence of the template used in this experiment is shown 
below and contains a unique Sph I cleavage site (GCATGC, underlined) at nucleotides 35-40: 

H9NTCGAATTCGTACGAATTCGAAGTCACTCATCCA TGCATGCA CTCATCCAGTCTTT 
TGTGCGGACGATCGTCCGCAC (SEQ ID NO: 12) 

[0134] The restriction endonuclease Sph I exclusively cleaves double-stranded UNA 
1 5 containing GCATGC. Therefore Sph I cleavage indicates the creation of double-stranded DNA 
in the template. We assume that double-stranded DNA is mutually exclusive with a PNA-DNA 
paired complex, and therefore Sph I cleavage also implies displacement of the PNA strand. The 
individual template was translated using the PNA aldehyde building blocks gact, gagt, ggat, and 
gcat (20 pmol template +160 pmol each building block per 50 |liL reaction). The translated 
20 template was displaced as described above. 

[0135] As an Sph I cleavage positive control, the untranslated double-hairpin template was 
also "filled in'' using DNA polymerase. To create the positive control, 20 pmol of untranslated 
template was combined with 250 |liM dNTPs and 1 unit Klenow (exo-) DNA polymerase in 25 
|LiL total Ecopol buffer. The reaction was incubated at 37 °C for 30 min, heated at 75 °C for 20 

25 min (to heat-inactivate the DNA polymerase), and subjected to gel filtration. An aliquot (20 

pmol) of each sample (template alone, "filled in" control template, translated product, displaced 
product) was digested in 25 |aL reactions containing NEB2 buffer (New England Biolabs) plus 5 
U Sph I for 90 min at 37 °C. Following digestion, samples were subjected to gel filtration, 
centrifuged under vacuum to dryness, resuspended in 50% formamide in IX TBE, heated to 95 

30 °C for 15 minutes, and analyzed by electrophoresis on a 10% denaturing (TBE-urea) PAGE gel 
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followed by staining with ethidium bromide. The resulting gel is shown in Figure 7, in which 
lane 1 = template plus Sph I; lane 2 = "filled in" template plus Sph I; lane 3 = translation product 
plus Sph I; lane 4 = translated and displacement product plus Sph L The presence of the fast- 
running band in both lane 2 and lane 4 represents cleaved double-stranded DNA in both the 
5 positive control (lane 2) and in the translated and displaced product (lane 4). 

(e) Protein affinity selections. 

[0136] The methods used for in vitro papain affinity selection have been reported previously 
(Giesen (1998) Nucleic ACIDS Res. 26, 5004-5006). Papain-conjugated sepharose beads (50 
|LiL) were prepared according to this protocol and combined with 10 pmol of translated and 
10 displaced PNA library. After 4 hours at 4 °C with slow mixing, the beads were filtered, washed 
three times with high salt buffer (50 mM Tris pH 7.5, 0.5 M NaCl), once with low salt buffer (50 
mM Tris pH 7.5, 0.1 MNaCl), and resuspended in 50 \xL papain selection buffer (50 mM Tris 
pH 7.5, 100 mM NaCl, 1 mM EDTA). One fifth of the beads were removed for PGR as 
described below. 

15 [0137] The remaining beads (40 |liL) were heated for 15 min at 95 °C, cooled to 60 °C, 

supplemented with an equal volume of fresh papain-linked sepharose beads; cooled rapidly to 4 
and the selection was repeated. This protocol was repeated three times (four rounds total) for 
the first-generation selection (Figure 4A) and was repeated twice (three rounds total) for the 
second-generation selection (Figure 6A). After each round of affinity selection, PGR was 

20 performed using one fifth of the total beads. Filtered beads were resuspended in 18 j^L NEB 

buffer 4 with BSA, 1 \xL Hha I, and 1 |liL HinPl I (all restriction endonucleases purchased from 
New England Biolabs). Reactions were incubated for 1.5 hours at 37 °G, then added directly into 
80 |LiL PGR master mix containing 5 jaL Taq buffer, 5 ]xL 25 mM MgGla, 1 |liL 100 |liM primer 1, 
1 laL 100 laM primer 2, 1 \xL 25 mM dNTPs, and 1 |aL Taq DNA polymerase and 54 \xL H2O. 

25 Primer sequences are as follows: 

Primer 1 

GGGCCGGGATGCGCAGGGTCGGA (SEQ ID NO: 13) 
Primer 2 

CCGCCGCTCGAGGCACCTTGTCC (SEQ ID NO: 14) 
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[0138] The PGR protocol was as follows: 25 cycles of 30 seconds at 94 °C, 30 seconds at 55 
°C, and 30 seconds at 72 °C. Selection PGR reactions were always performed side-by-side with 
a control reaction in which 20 fxL water was used in the place of selection beads. In Figure 4B, 
10 )LiL of each PGR reaction was analyzed by 2.5% agarose gel electrophoresis. 

5 (f) Cloning and analysis of sequences surviving selection. 

[0139] The 86-base pair PGR product resulting from the final round of selection (fourth 
round for first-generation library; third round from second-generation library) was purified by 
agarose gel electrophoresis, digested with BamH I andXho I restriction endonucleases, and 
ligated into pBluescript II (Stratagene) digested with BamH I and Xho I. The ligation was 
10 transformed into 40 |liL electrocompetent DHIOB cells (Invitrogen) by electroporation. 

Individual colonies were cultured and their plasmids were sequenced using standard automated 
fluorescence-based DNA sequencing methods with the following primer: 

GACAGAGGAAAGAGGTATGAGGATG (SEQ ID NO: 15) 

Alignments of the resulting sequences were performed using the ClustalW algorithm (Zuker, M. 
15 Mfold web server for nucleic acid folding and hybridization prediction. NUCLEIC ACIDS RES. 
31,3406-15 (2003)). 

(g) Enrichment assays. 

[0140] For the enrichment experiments described in Figure 5A, 10 pmol total template 
mixture (500:1 Ml template:Pl template; or 500:1 Ul template:Pl template) was translated as 
20 described above using library translation conditions. The individual template sequences are as 
follows: 

Pl template 

H2NTGGGAGGGTGCGGAGCGTCGGAAGCGATTGATGGATGCACGCAGGCATGGATTG 
ATTGATGGGGAGAAGGTGGGGAGCTTGTGG (SEQ ID NO: 16) 

25 Ml template 

H2NTGCGACGGTGCGCACCGTCGCAAGCCATTCATGCACCCATGGAGGCATGCATTC 
ATTCATGCGGACAAGGTGCGCACCTTGTCC (SEQ ID NO: 17) 

Ul template 
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H2NTGCGACGGTGCGCACCGTCGCAAGTCAGTCACCCAATCACTCACGCATCCAGTC 
ACCCAACCGGACAAGGTGCGCACCTTGTCC (SEQ ID NO: 18) 

[0141] The translated products were displaced as described above, subjected to three rounds 
of selection against papain as described above, and the resulting bound DNA templates were 
5 digested and amplified by PGR as described above. The PGR product resulting from the Ml/Pl 
experiment was digested with the restriction endonuclease BsaJl for 4 hours at 60 °G, and 
compared to digests of PGR products from Ml template or PI template alone. The PGR product 
resulting from the Ul/Pl enrichment experiment was digested with Fok I for 4 hours at 37 °C 
and compared to digests of PGR products from Ul or PI template alone. 

10 (h) Solid-phase PNA synthesis. 

[0142] The solid-phase syntheses of selected PNA sequences (truncated PI, truncated P2, 
and truncated Ml) were executed according to established protocols (Reader et ah (2002) 
Nature 420, 841-4). Briefly, 50 mg FMOC-PAL-PEG-PS resin (Applied Biosystems) with a 
loading level of 0. 17 mmol/g was swelled in DMF, and the FMOC group was removed with two 

15 washes of 5 mL 20% piperidine. All manipulations were performed at 25 °C in a peptide 
synthesis vessel with N2 gas bubbled through the reaction mixtures. Each cycle of peptide 
synthesis consisted of: (/) coupling with an activated FMOC-protected monomer for 15 min, 
followed by washing with DMF; (//) capping with 5% acetic anhydride in DMF for 5 min, 
followed by washing with DMF; and (///) deprotection of the FMOG group with two washes of 5 

20 mL 20% piperidine for 5 min, followed by washing with DMF. Monomers were activated by 
combining 50 |Limol FMOG-PNA monomer (26-36 mg, depending on the monomer), 50 jixmol 
HATU (19 mg), 100 jumol DIPEA (17.4 |aL), and 50 jumol 2,6-lutidine (5.8 jiiL) in 1.2 mL DMF, 
and incubating the resulting reaction 7 min at 25 °G. 

[0143] After all peptide synthesis cycles were complete, the PNA was cleaved from solid 
25 support by resuspending the resin in a solution consisting of TFA + 10% /w-cresol + 3% H2O and 
bubbling with N2 for 1 hour. The solution was filtered and PNAs were precipitated from the 
filtrate by adding 50 mL ice-cold t-butyl methyl ether (TBME) to the cleavage cocktail, after 
which a white precipitate immediately formed. The precipitate was isolated by centrifugation, 
and the pellet was washed several times with additional TBME. After lyophilization, the crude 
30 cleavage mixture was subjected to reverse-phase HPLC with a gradient from 0% to 35% 
acetonitrile in 0.1% aqueous TFA over 30 min (flow rate == 4 mL/min). Peaks (monitoring 
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absorbance at 260 nm) corresponding to PNA peptides were collected from 16 min to 22 min 
elution times, and each eluted peak was analyzed by MALDI mass spectrometry (sinapinic acid 
matrix) on an Applied Biosystems Voyager mass spectrometer. The peak corresponding to 
intact 18-base PNA peptide was consistently the last peak to elute from the column with 
5 absorption at 260 nm. The masses of the 18-base PNAs are as follows: 

PI: Expected = 4929.5; Observed = 4929.7 

Ml: Expected = 4929.5; Observed = 4929.0 

P2: Expected = 4913.5; Observed = 4913.4 

(i) Papain binding assays. 

10 [0144] Purified PNA 18-mers prepared as described above were reacted with BODIPY- 

fluorescein succinamidyl ester (BFL-SE, Molecular Probes). For each labelling reaction, 25 |liL 
of a 250-400 |liM stock solution of PNA in H2O was mixed with 25 jliL 0.2 M sodium 
bicarbonate (pH 8.0) plus 2.5 |liL of a 20 mg/mL solution of BFL-SE in DMSO. After four hours 
at 25 °C, 50 ixL DMSO was added, and the labelling reaction was incubated for an additional 4 

15 hours. The labelled product was purified by reverse-phase HPLC as described above. The 

addition of the BFL fluorophore on the N-terminal primary amine typically delayed the elution 
time of the PNA by 5 min, and the absorption of the eluted peak was monitored both at 260 nm 
and at 470 nm. Masses of labelled peptides were confirmed by MALDI mass spectrometry, as 
follows: 

20 PI labelled: Expected = 5317.7; Observed = 5317.1 
Ml labelled: Expected = 5317.7; Observed = 5316.9 
P2 labelled: Expected = 5301.7; Observed = 5302.2 

[0145] Fluorescence polarization-based papain afBnity assays were carried out as follows. 
BFL-labelled PNA was used at concentrations between 5 nM and 10 nM. This concentration 
25 range routinely provided signahnoise ratios exceeding 20: 1 . For the BFL-amine controls in 

Figures 5B and 6C, BFL-ethylenediamine (Molecular Probes) was used at a concentration of 10 
nM. For each labelled PNA or control, a series of binding reactions were established containing 
papain selection buffer (see above), the labelled PNA or control, and one of several 
concentrations of papain. Papain (Sigma-Aldrich) was dissolved in papain activation buffer (5.5 
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mM cysteine HCl, 1.1 mM EDTA, and 0,067 mM p-mercaptoethanol) at 5 mg/mL. The protein 
was precisely diluted (in activation buffer) to a concentration of 100 |aM. This stock solution 
was then used for all binding assays. After the binding reactions were equilibrated (4 hours at 25 
°C), they were loaded onto a Nunc opaque black 384-well plate, and fluorescence polarization 
5 was measured using the Analyst AD system (Molecular Devices) with a fluorescein filter. 
Fluorescence polarization data in Figures 5B and 6C is shown as the change in fluorescence 
polarization (calculated relative to the output in the absence of protein) as a function of papain 
concentration. For the protein-binding specificity experiment shown in Figure 5C, the same 
protocol was used, except that papain was replaced with trypsin or lysozyme, and protein was 
10 simply dissolved in papain selection buffer (rather than in activation buffer). 

[0146] The papain binding assays of the truncated P2 PNA in the presence of DNA 
complementary to P2 were carried out as above, except that before the assays took place, a 500 
nM solution of BFL-labelled truncated P2 PNA was combined with 5 |liM (10 equiv) of "anti- 
P2" DNA or with "anti-control" DNA in papain selection buffer: 

15 Anti-P2: GCATGCAGCCAGTCATGC (SEQ ID NO: 19) (complementary to P2) 

Anti-control: GCAGTCATGCATGCAGCC (SEQ ID NO: 20) (scrambled codon control) 

[0147] The solution was heated to 95 °C for 2 min, slowly cooled to 25 ""C over 30 min, then 
incubated at 25 ""C for 1 hour. The resulting solution was diluted 50-fold into each binding 
reaction (10 nM P2 per assay), and assayed as described above. 

20 2. RESULTS 

[0148] To generate libraries of PNA polymers using DNA-templated PNA aldehyde 
coupling, several codon sets were tested that were predicted to have well-matched PNA:DNA 
base-pairing stabilities (Giesen et al (1998) Nucleic Acids Res. 26, 5004-5006), and that 
minimize frameshifting by enforcing at least one mismatch when codons are misaligned on 
25 templates. Each of the nine PNA aldehyde building blocks of the sequence gvvt (where lower 
case letters are used to represent PNAs, and v = a, c, or g) undergo highly efficient, sequence- 
specific polymerization reactions with a variety of 40-base complementary DNA templates 
(ABBC)io (where B = T, G, or C) in the presence of NaBHaCN to generate predominantly full- 
length synthetic polymer products (Figure 3A). As shown in Figure 3A, these polymerization 
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reactions proceed efficiently even when templates contain five or ten copies of any one of the 
nine possible ABBC codons. 

[0149] When an equimolar mixture of the nine gvvt PNA aldehyde building blocks was 
combined with a library of up to 3.5 x 10^ different DNA templates consisting of ten consecutive 
5 randomized ABBC codons flanked on either side by 22-base hairpins, an efficient conversion to 
species whose denaturing PAGE mobility consistent with a library of PNA covalently linked to 
their DNA templates (Figure 3B) was observed. 

[0150] To allow the PNA component of the library to fold prior to selection, the PNA strand 
of each library member was freed fi*om base pairing with its DNA template. To accomplish this 

10 goal, the 3' hairpin of each template was extended using a strand displacement-competent 

thermophilic DNA polymerase and dNTPs for 30 min at 72 °C. Because this approach generates 
double-stranded DNA templates, it also prevented the DNA component of the resulting library 
from folding into conformations that may survive selection (Joyce, G. F. (2004) Annu. Rev. 
Biochem. 73, 791-836) independent of the PNA. Szostak and co-workers recently showed that a 

15 similar strategy successfully displaced the threose nucleic acid (TNA) strand of a TNA:DNA 
duplex (Ichida, J. K. et al (2005) J. AM. Chem. Soc. 127, 2802-3). For both single-sequence 
PNA-DNA hairpin conjugates (data not shown) as well as for PNA-DNA libraries (Figure 4A), 
a significant shift in PAGE mobility upon treatment with DNA polymerase and dNTPs was 
observed. Experiments using restriction endonucleases specific for double-stranded DNA 

20 indicate that products firom these displacement reactions predominantly exist in a form in which 
the translated PNA strand is no longer base-paired with the DNA template (see Materials and 
Methods), consistent with successfiil PNA strand displacement. 

[0151] The library of translated and displaced PNAs covalently linked to their corresponding 
DNA templates were subjected to in vitro affinity selections (Doyon et al (2003) J. AM. CHEM. 

25 Soc. 125, 12372-3) for binding to papain, a commercially available cysteine protease. The 

(gvvt) 10 library arising from 10 pmol of starting DNA template was incubated with papain-linked 
beads at 4 °C for 4 hours. The beads were extensively washed to remove non-binders. The 
washed beads were heated to elute bound library members, then combined (without filtration) 
with a fresh aliquot of papain-linked beads and incubated and washed as before. In contrast with 

30 a traditional affinity selection protocol in which eluted binders are transferred to new vessels 
between rounds, it was found that this protocol dramatically reduced material losses between 
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rounds without compromising the effectiveness of each round of selection (see Materials and 
Methods), 

[0152] After four rounds of affinity selection as described above, the templates surviving 
selection were amplified by PGR (Figure 4B), cloned into pBluescript, and sequenced to reveal 
5 the identity of the DNA templates (and, by inference, the identity of the PNA polymers) 

surviving the papain binding selection. Three of the nine sequenced clones were identical or 
differed only at one base, suggesting that the PNA corresponding to this sequence (designated 
PI) may possess papain-binding activity (Figure 4C). Although secondary structure prediction 
algorithms for PNA have not been reported, when analyzed by the mFold RNA secondary 
10 structure prediction method (Zuker (2003) Nucleic Acids Res. 31, 3406-15 ) using a high 

simulated salt concentration to minimize error arising fi-om the mostly uncharged state of the 
PNA backbone, the PI sequence was predicted to form a strong stem-loop structure consisting of 
an eight base-pair stem and a six-base loop (Figure 4D). 

[0153] As an initial characterization of the PI translation product, a series of enrichment 
15 experiments were carried out that were designed to compare the survival of PI during the papain 
affinity selection with that of closely related or unrelated PNA-DNA conjugates. The DNA 
template encoding PI was combined with a 500-fold excess of a DNA template encoding a 
mutant of PI, designated Ml. The locations of a gggt codon predicted to lie in Pi's loop and a 
gcat codon predicted to lie in Pi's stem were swapped in Ml, which was otherwise identical to 
20 PI (Figure 5A). The 500:1 M1:P1 template mixture was translated, displaced, and subjected to 
three rounds of papain affinity selection as described above. The DNA templates fi-om species 
surviving the third selection round were amplified by PGR and analyzed by restriction digestion. 
The Pl-encoding template was enriched ~500-fold afl;er selection relative to the Ml-encoding 
template (Figure 5 A), demonstrating that the order of codons with PI determines its ability to 
25 survive the selection. 

[0154] To confirm that the above enrichment of PI does not arise from an unusually poor 
affinity of Ml for papain, the experiment was repeated with Ml and an unrelated library 
sequence (designated Ul). Translation, displacement, and selection beginning with a 500:1 ratio 
of M1:U1 DNA templates resulted in no detectable enrichment for Ul (Figure 5A). Taken 
30 together, these results indicate that the PI translation product survives papain affinity selection 
much more efficiently than either a codon-swapped mutant (Ml) or an unrelated sequence (Ul). 
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[0155] To characterize the ability of the selected synthetic polymer to bind papain in the 
absence of its DNA template, a portion of the PI PNA was synthesized by solid-phase synthesis. 
Simplifying assumptions were made which included that (/) the predicted stem-loop region of PI 
was responsible for its putative papain affinity, and (ii) the secondary amine linkages between 
5 every fourth PNA nucleotide arising from DNA-templated reductive amination could be replaced 
by standard amide linkages without abolishing papain binding activity. The 18-base PNA 
corresponding to the majority of the PI stem-loop (Figure 5B, expected mass = 4929.5 D; 
observed mass = 4929.7 D) was conjugated to the fluorophore BFL at its amino terminus to 
enable papain binding assays using fluorescence polarization. As controls, the 18-base PNA 
10 corresponding to the Ml mutant of PI was also synthesized on solid-phase and conjugated to 
BFL (expected mass = 4929.5 D; observed mass = 4929.0 D), as well as the 18-base DNA 
analogue of PI containing a 5'-amino group (Figure 5B). 

[0156] Fluorescence polarization assays in the presence of varying concentrations of papain 
revealed that the PI stem-loop PNA possesses significant papain affinity. Although binding was 

15 not fully saturated at the maximum papain concentration that could be tested (70 |j,M), the of 
the Pl-papain complex was estimated to be approximately 25 \xM (Figure 5B). In contrast, 
neither Ml nor the DNA analogue of the PI stem loop exhibited any detectable papain affinity. 
The BFL fluorophore alone also did not bind papain (Figure 5B). These results indicate that the 
PI synthetic polymer discovered from the translation and selection process described above 

20 possesses papain-binding activity. In addition, this activity is dependent on the sequence of 

monomers and on the structure of the polymer's backbone, but does not require the presence of 
the DNA template. 

[0157] In order to determine if PI bound proteins in a non-specific manner, or if it exhibited 
target-binding specificity, the protein-binding assays were repeated with two proteins unrelated 
25 to papain: trypsin and lysozyme. Truncated PI PNA did not exhibit detectable affinity for either 
trypsin or lysozyme (Figure 5C). These results are consistent with the hypothesis that PI does 
not simply bind non-specifically to common features of proteins (such as the presence of 
partially exposed hydrophobic groups), but instead adopts a three-dimensional conformation that 
is at least partially selective for binding to papain. 

30 [0158] In an effort to evolve a synthetic polymer with improved functional properties, a 

second-generation template library based on PI was created in which three of the codons at the 
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end of the predicted stem-loop were randomized at a total of five nucleotide positions (Figure 
6A). The corresponding DNA template library (theoretical size of 243 members) was translated, 
displaced, and subjected to three iterated rounds of papain affinity selection as described above. 
A majority (8 out of 13, or 62%) of the clones recovered jfrom this second-generation selection 
5 are of a single sequence, designated P2 (Figure 6A). Although three of the five randomized 
positions in P2 converge on the parental PI sequence, P2 contains two new mutations: a [c-^a] 
mutation in the stem and a [g— >c] mutation in the loop. The former mutation, present in all 13 of 
the sequenced second-generation clones, is predicted to shorten the PI stem and expand the loop 
by two bases (Figure 6B). 

10 [0159] The solid-phase synthesis of the 1 8-base PNA stem-loop correspondhig to P2 and 
labelling its amino terminus with BFL was performed as before. Fluorescence polarization 
assays revealed that this truncated P2 PNA bound to papain with significantly improved affinity 
(K^ = 5 \xM) (Figure 6C). In addition, binding of truncated P2 to papain was inhibited by pre- 
incubation with a DNA 1 8-mer complementary to the P2 PNA sequence, but was not inhibited 

15 by a DNA 1 8-mer containing the P2-complementary codons in a scrambled order (Figure 6D), 
further suggesting that P2 secondary structure was required for papain binding activity. 

[0160] Collectively, these results represent the in vitro evolution of a purely synthetic 
polymer and demonstrate that template-directed, non-enzymatic synthesis can proceed with 
sufficient fidelity and efficiency to support laboratory evolution. These findings also establish 
20 that mixed-sequence PNA polymers can access conformations capable of selective binding to a 
target protein, even when a constrained genetic code (in this case, gvvt) is used to avoid 
frameshifting. 

[0161] The extent to which the P2 translation product was present in the initial library is 
unknown; it may have been underrepresented such that the initial library did not contain a 

25 sufficient number of P2 molecules to enable its emergence in the first selection. In addition, the 
initial four rounds of selection may not have sufficiently enriched the best papain binders to 
enable P2 to be represented at a detectable level within the sampled sequences, even though P2 
could be readily accessed in a smaller, focused library of PI variants. The emergence of a 
second-generation synthetic polymer with improved functional properties demonstrates the value 

30 of an additional round of mutagenesis, retranslation, and reselection even when theoretical 
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library sizes (3.5 x 10^ in this case) may not exceed the number of molecules that are created in a 
single library. 

Incorporation By Reference 

[0162] The entire contents of each of the publications, patents and patent applications cited 
5 herein are incorporated by reference into this application for all purposes. 

Equivalents 

[0163] The invention may be embodied in other specific forms without departing form the 
spirit or essential characteristics thereof. The foregoing embodiments are therefore to be 
considered in all respects illustrative rather than limiting on the invention described herein. 
10 Scope of the invention is thus indicated by the appended claims rather than by the foregoing 
description, and all changes that come within the meaning and range of equivalency of the 
claims are intended to be embraced therein. 
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Claims 

What is claimed is: 



1 1 . An in vitro method for evolving a polymer having a particular property, the method 

2 comprising the steps of: 

3 (a) producing a mixture of polymers, wherein each polymer is associated with a template 

4 oligonucleotide that encoded the synthesis of the polymer; 

5 (b) selecting from the mixture produced in step (a) a polymer having a particular 

6 property, wherein the selected polymer is associated with the template that encoded its synthesis; 

7 (c) obtaining information about the sequence of the template associated with the polymer 

8 selected in step (b); 

9 (d) producing a plurality of evolved templates, each of which differs by at least one base 

10 from the template associated with the polymer selected in step (b); 

1 1 (e) producing a mixture of evolved polymers using the evolved templates, wherein each 

12 evolved polymer is associated with the template that encoded its synthesis; and 

13 (f) selecting from the mixture produced in step (e) an evolved polymer having the 

1 4 particular property. 

1 2, The method of claim 1, wherein in step (a), the polymer is covalently attached to the 

2 template oligonucleotide. 

1 3. The method of claim 1 or 2, wherein in step (e), the polymer is covalently attached to the 

2 template oligonucleotide. 

1 4. The method of any one of claims 1-3, wherein the polymer is a PNA. 

1 5. The method of any one of claims 1-4, comprising the additional step of, after step (a) but 

2 before step (b), permitting the polymer to fold. 

1 6. The method of claim 2 or 3, wherein the polymer becomes substantially disassociated 

2 from the template. 

1 7. The method of claim 6, wherein an oligonucleotide complementary to the template 

2 disassociates the polymer from the template. 
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1 8. The method of any one of claims 1-6, wherein the property is catalytic activity, binding 

2 activity, solubility, or stability. 

1 9. The method of any one of claims 1-6, wherein the polymer produced in step (e) has a 

2 more desirable property than the polymer produced in step (a). 

1 10. An vitro method for evolving a polymer having a particular property, the method 

2 comprising: 

3 (a) combining (i) a plurality of different templates, wherein each template comprises a 

4 first codon and a second codon, with (ii) a plurality of transfer units, at least one of which 

5 comprises a monomeric subunit associated with an oligonucleotide having a first anti-codon 

6 capable of annealing to the first codon of a given template and at least one of which comprises a 

7 different monomeric subunit associated with an oligonucleotide comprising a second anti-codon 

8 capable of annealing to the second codon of a given template under conditions to permit transfer 

9 units to anneal to a particular template and to permit at least one monomer subunit to become 

10 covalenty linked to a different monomer subunit to produce a polymer associated with the 

1 1 template that encoded its synthesis; 

12 (b) selecting a polymer having a particular property, wherein the polymer remains 

13 associated with the template that encoded its synthesis; 

14 (c) obtaining sequence information about the template associated with the polymer 

15 selected in step (b); 

16 (d) obtaining a plurality of evolved templates that contain a codon that differs by at least 

17 one base from the template associated with the polymer selected in step (b); 

18 (e) combining (i) the plurality of evolved templates with (ii) a said plurality of transfer 

19 units under conditions to permit transfer units to anneal to a particular template and to permit a 

20 first monomer subunit to become covalently linked to a second monomer subunit to produce an 

21 evolved polymer associated with the evolved template that encoded its synthesis; and 

22 (e) selecting an evolved polymer having the particular property. 

1 11. The method of claim 10, wherein in step (a), the polymer is covalently attached to the 

2 template that encoded its synthesis. 
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1 12. The method of claim 10 or 1 1, wherein in step (e), the evolved polymer is covalently 

2 attached to the evolved template that encoded its synthesis. 

1 13. The method of any one of claims 10-12, wherein the polymer is a PNA. 

1 14. The method of any one of claims 10-13, comprising the additional step of, after step (a) 

2 but before step (b), permitting the polymer to fold. 

1 15. The method of claim 1 1 or 12, wherein the polymer is disassociated from the template 

2 that encoded its synthesis. 

1 16. The method of claim 1 5, wherein an oligonucleotide complementary to the template 

2 disassociates the polymer from the template. 

1 17. The method of any one of claims 10-16, wherein the property is catalytic activity, 

2 binding activity, solubility, or stability. 

1 18. The method of any one of claims 10-16, wherein the polymer produced in step (e) has a 

2 more desirable property than the polymer produced in step (a). 

1 19. A method of selecting a polymer capable of binding to a target molecule, the method 

2 comprising: 

3 (a) combining a plurality of polymers associated with oligonucleotide templates that 

4 encoded their synthesis with a solid support having the target molecule disposed thereon under 

5 conditions to permit poljoners to bind to the target molecule; 

6 (b) removing unbound polymers; 

7 (c) disassociating the bound polymers from the solid support to produce a first fraction 

8 enriched for polymers that bind to the target molecule; 

9 (d) combining the disassociated polymers with a fresh solid support having the target 

1 0 molecule disposed therein under conditions to permit polymers to bind to the target molecule; 

1 1 (e) removing unbound polymers; and 

12 (f) disassociating the bound polymers from the solid support to provide a second fraction 

13 enriched for polymers that bind to the target molecule, wherein the second fraction contains a 

14 greater proportion of polymers that bind to the target than the first fraction. 
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1 20. The method of claim 19, wherein in step (d) fresh solid support is combined with the 

2 disassociated polymer in the presence of the solid support used in step (a). 

1 21. The method of claim 19 or 20, wherein the polymer is a PNA. 

1 22. The method of any one of claims 1-21, wherein the polymer is a non-naturally occurring 

2 polymer. 

1 23. The method of any one of claims 1-21, wherein the polymer is not a biological polymer. 

1 24. The method of any one of claims 1-23, wherein the template is an oligonucleotide. 
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